minimax/voice-cloning ❓🔢🖼️✓ → ❓
About
Clone voices to use with Minimax's speech-02-hd and speech-02-turbo

Example Output
Output
{"model":"speech-02-turbo","preview":"https://replicate.delivery/xezq/p80hlWW4YWptBh3YGnNEDmR8ldh9QQDCxZNrICRge2HgT9UKA/tmpuo0ipa91.mp3","voice_id":"R8_FDU1SV5S"}
Performance Metrics
34.10s
Prediction Time
34.12s
Total Time
All Input Parameters
{ "model": "speech-02-turbo", "accuracy": 0.7, "voice_file": "https://replicate.delivery/czjl/21U5IFboRwrhBlKks9pmaz119Hvo1ISryE0LNUKuerpqS9UKA/output.wav", "need_noise_reduction": false, "need_volume_normalization": false }
Input Parameters
- model
- The text-to-speech model to train
- accuracy
- Text validation accuracy threshold (0-1)
- voice_file (required)
- Voice file to clone. Must be MP3, M4A, or WAV format, 10s to 5min duration, and less than 20MB.
- need_noise_reduction
- Enable noise reduction. Use this if the voice file has background noise.
- need_volume_normalization
- Enable volume normalization
Output Schema
Example Execution Logs
Uploaded voice file in 2.10sec Cloned voice in 18.48sec Generating speech with model speech-02-turbo Generated speech in 13.18sec Voice cloned successfully with ID: R8_FDU1SV5S
Version Details
- Version ID
aa25ee1296b5c036b003ef80d32c83983c522e8c7d6f108460bbb0af97ebe93a
- Version Created
- May 6, 2025