voice-cloning AI Models - Page 5

cuuupid/zonos

Generate speech from text with optional voice cloning from a short reference audio. Accept text plus a 5–30s speaker sam...

📝 → 🔊 • text-to-speech • voice-cloning • multilingual • 267 runs

🤖 Model 📝 → 🔊

lucataco/whisperspeech-small

Convert text to speech with optional zero-shot voice cloning. Accept a text prompt and an optional speaker reference aud...

📝 → 🔊 • text-to-speech • voice-cloning • 1.6K runs

🤖 Model 📝 → 🔊

suno-ai/bark

Generate speech, music, background noise, and simple sound effects from a text prompt. Output an audio file, with an opt...

📝 → 🔊 • text-to-speech • music-generation • sound-effect-generation • 302.0K runs

🤖 Model 📝 → 🔊

kimanh2023/xvdsdfstevbrsgwesfgesaraw

Generate speech in a cloned voice from text input. Provide a reference audio clip and its transcript to capture the targ...

📝 → 🔊 • voice-cloning • text-to-speech • 24 runs

🤖 Model 📝 → 🔊

suminhthanh/vixtts

Clone a speaker’s voice from a 6-second sample and synthesize speech from text in Vietnamese and 17 other languages. Acc...

📝 → 🔊 • text-to-speech • voice-cloning • 512 runs

🤖 Model 🔊

ahm3texe/test999

Converts input audio using RVC (Retrieval-based Voice Conversion) technology to transform vocals into different voice st...

🔊 • voice-cloning • audio-to-audio • 11 runs

🤖 Model 📝 → 🔊

kjjk10/llasa-3b-long

Clone a voice from a short reference sample and synthesize speech from text. Provide a voice sample and target text to g...

📝 → 🔊 • text-to-speech • voice-cloning • 1.5K runs

🤖 Model 📝 → 🔊

adirik/hierspeechpp

Clone a target voice and synthesize speech from text or convert reference speech to the target voice (zero-shot). Provid...

📝 → 🔊 • text-to-speech • voice-cloning • audio-to-audio • 4.6K runs

🤖 Model 📝 → 🔊

tuannha/f5-tts-vi

Generate Vietnamese speech from text with zero-shot voice cloning from a reference audio sample. Accepts input text, a r...

📝 → 🔊 • text-to-speech • voice-cloning • vietnamese • 84 runs

🤖 Model

minimax/music-01

Generate up to 60 seconds of music with vocals from lyrics and a reference track. Condition on a reference song to learn...

music-generation • singing-voice-generation • voice-cloning • 415.7K runs

🤖 Model 🎥

subformer/video-dubbing

Dub videos into 100+ languages with cloned voices. Takes a video input and returns a dubbed video with translated speech...

🎥 • video-to-video • speech-translation • voice-cloning • 38 runs

🤖 Model

replicate/train-rvc-model

Train a custom RVC (Realistic Voice Cloning) voice-conversion model from an audio dataset. Input a dataset zip of segmen...

voice-cloning • voice-model-training • 349.0K runs

🤖 Model 📝 → 🔊

resemble-ai/chatterbox-turbo

Convert text to speech with low latency for voice agents, narration, and interactive applications. Accepts text (up to 5...

📝 → 🔊 • text-to-speech • voice-cloning • 15 runs

🤖 Model 📝 → 🔊

qwen/qwen3-tts

Generate multilingual speech from text with preset voices, voice cloning, and voice design. Accept text plus optional la...

📝 → 🔊 • text-to-speech • voice-cloning • multilingual • 216.6K runs

🤖 Model 📝 → 🔊

qwen/qwen-tts

Generate speech from text with preset, cloned, or designed voices. Accept text as input and return spoken audio. Choose...

📝 → 🔊 • text-to-speech • voice-cloning • multilingual • 9 runs

🤖 Model 📝 → 🔊

voiser-ai/moss-tts

Convert text to speech with optional voice cloning from a reference audio sample. Accepts text and an optional speaker r...

📝 → 🔊 • text-to-speech • voice-cloning • 15 runs

🤖 Model 📝 → 🔊

inworld/tts-1.5-max

Converts text to natural, expressive speech with low latency under 200ms. Supports 15 languages including English, Chine...

📝 → 🔊 • text-to-speech • voice-cloning • 79.5K runs

🤖 Model 📝 → 🔊

inworld/tts-1.5-mini

Converts text to speech with ultra-fast ~120ms latency and support for 15 languages including English, Chinese, Japanese...

📝 → 🔊 • text-to-speech • voice-cloning • 30.2K runs

🤖 Model 📝 → 🔊

minimax/speech-2.8-hd

Convert text to natural-sounding speech. Generate high-fidelity audio from up to 10,000 characters with 17+ preset voice...

📝 → 🔊 • text-to-speech • voice-cloning • 2 runs

🤖 Model 📝 → 🔊

ttsds/maskgct

Convert text to speech with zero-shot voice cloning from a reference audio sample. Provide target text and language (Eng...

📝 → 🔊 • text-to-speech • voice-cloning • 483 runs