voice-cloning AI Models - Page 2

thomcle/chatterbox-tts

Convert text to speech with optional zero-shot voice cloning from a short reference audio clip. Accepts text and an opti...

📝 → 🔊 • text-to-speech • voice-cloning • 1.0K runs

🤖 Model 🔊

jagilley/free-vc

Convert spoken audio to a target speaker’s voice using a reference sample. Provide a source audio file (content) and a r...

🔊 • audio-to-audio • voice-cloning • 76.0K runs

🤖 Model 📝 → 🔊

chenxwh/openvoice

Clone a voice from a short reference clip and generate speech from text. Accepts text and a reference audio sample; outp...

📝 → 🔊 • text-to-speech • voice-cloning • multilingual • 77.8K runs

🤖 Model 📝 → 🔊

cjwbw/voicecraft

Generate speech from text using a short voice reference, or edit existing speech with insertion, substitution, and delet...

📝 → 🔊 • text-to-speech • audio-to-audio • voice-cloning • 10.6K runs

🤖 Model 📝 → 🔊

x-lance/f5-tts

Synthesize speech from text in a cloned voice using a reference audio sample. Provide a text prompt and speaker referenc...

📝 → 🔊 • text-to-speech • voice-cloning • 32.1K runs

🤖 Model 📝 → 🔊

resemble-ai/chatterbox-multilingual

Generate expressive multilingual speech from text. Accept a text prompt and a language selection (ar, da, de, el, en, es...

📝 → 🔊 • text-to-speech • voice-cloning • multilingual-tts • 2.7K runs

🤖 Model 📝 → 🔊

platform-kit/mars5-tts

Generate English speech from text with zero-shot voice cloning from a 5–10s reference clip. Provide text, a short refere...

📝 → 🔊 • text-to-speech • voice-cloning • 517 runs

🤖 Model 📝 → 🔊

afiaka87/tortoise-tts

Generate speech from text with optional voice cloning from a reference voice sample. Accepts a text prompt and optional...

📝 → 🔊 • text-to-speech • voice-cloning • 172.5K runs

🤖 Model 📝 → 🔊

adirik/styletts2

Convert text to expressive speech, with optional speaker style cloning from a short reference audio. Accepts text input...

📝 → 🔊 • text-to-speech • voice-cloning • 131.8K runs

🤖 Model 🔊

zsxkib/realistic-voice-cloning

Create AI song covers by converting vocals in an input audio track to a target voice with RVC v2. Provide an audio file...

🔊 • voice-cloning • audio-to-audio • 1.2M runs

🤖 Model 📝 → 🔊

fermatresearch/spanish-f5-tts

Generate Spanish speech from text by cloning the voice from a reference audio. Provide Spanish text, a reference audio s...

📝 → 🔊 • text-to-speech • voice-cloning • spanish • 918 runs

🤖 Model 📝 → 🔊

ttsds/voicecraft

Generate speech from text in a cloned speaker’s voice. Provide a text prompt, a reference audio sample of the target voi...

📝 → 🔊 • text-to-speech • voice-cloning • 512 runs

🤖 Model 📝 → 🔊

usamaehsan/voices

Generate speech from text and convert voices. Use zero_shot voice cloning to synthesize speech in the style of a prompt_...

📝 → 🔊 • text-to-speech • voice-cloning • audio-to-audio • 877 runs

🤖 Model 📝 → 🔊

jichengdu/cosyvoice

Clone a speaker's voice and synthesize speech from text, including cross-lingual and mixed-lingual output. Accepts refer...

📝 → 🔊 • text-to-speech • voice-cloning • multilingual • 1.7K runs

🤖 Model 📝 → 🔊

chenxwh/cosyvoice2-0.5b

Generate multilingual speech from text with zero-shot voice cloning. Provide a short reference audio clip and its transc...

📝 → 🔊 • text-to-speech • voice-cloning • 6.3K runs

🤖 Model 📝 → 🔊

ttsds/hierspeechpp_1_1

Generate speech from text conditioned on a reference voice sample. Input text and a speaker reference audio clip, and ou...

📝 → 🔊 • text-to-speech • voice-cloning • 258 runs

🤖 Model 📝 → 🔊

jichengdu/spark-tts

Generate spoken audio from text. Clone a target voice by providing a prompt audio sample (voice_cloning mode), or synthe...

📝 → 🔊 • text-to-speech • voice-cloning • 247 runs

🤖 Model 📝 → 🔊

ttsds/fishspeech_1_1

Generate speech audio from text with voice cloning from a reference speaker clip. Accepts target text to speak, a speake...

📝 → 🔊 • text-to-speech • voice-cloning • 192 runs

🤖 Model 📝 → 🔊

codehappynice/voicegenerator

Clone a voice from a short reference audio and synthesize speech from text. Provide at least 6 seconds of speaker audio...

📝 → 🔊 • text-to-speech • voice-cloning • 39 runs

🤖 Model 🔊

pseudoram/rvc-v2

Convert speech to a target voice using RVC v2 voice models. Takes an input speech audio clip and outputs converted audio...

🔊 • audio-to-audio • voice-cloning • speech-style-transfer • 1.1M runs