voice-cloning AI Models - Cloudernative

zsxkib/dia

Generate multi-speaker dialogue audio from text. Specify speakers with [S1], [S2], etc., and include non-verbal cues in...

📝 → 🔊 • text-to-speech • voice-cloning • multi-speaker-tts • 9.3K runs

🤖 Model 📝 → 🔊

lucataco/neutts-air

Clone a voice from a short reference sample and synthesize new speech from text. Accepts text to speak, a 3–15s mono ref...

📝 → 🔊 • text-to-speech • voice-cloning • 168 runs

🤖 Model 📝 → 🔊

acappemin/deepaudio-v1

Generate speech and soundtracks from a video input. Condition speech on a provided transcript (text) and optionally a re...

📝 → 🔊 • video-to-audio • text-to-speech • 63 runs

🤖 Model 📝 → 🔊

ttsds/pheme

Generate speech from text in the voice of a reference speaker. Takes a text prompt, a speaker reference audio clip, and...

📝 → 🔊 • text-to-speech • voice-cloning • 695 runs

🤖 Model 📝 → 🔊

ttsds/xtts_1

Generate speech from text using a cloned voice from a reference audio sample. Accept text and a speaker reference, then...

📝 → 🔊 • text-to-speech • voice-cloning • multilingual • 714 runs

🤖 Model 📝 → 🔊

ttsds/e2

Generate speech audio from text, cloning the voice from a provided reference recording. Provide the text to speak, a spe...

📝 → 🔊 • text-to-speech • voice-cloning • 270 runs

🤖 Model 📝 → 🔊

ttsds/f5

Generate speech from text in a cloned voice using a reference audio sample and its transcript. Accepts text plus speaker...

📝 → 🔊 • text-to-speech • voice-cloning • 2.7K runs

🤖 Model 📝 → 🔊

lucataco/indextts-2

Generate expressive speech from text with zero-shot voice cloning using a reference speaker audio input. Control emotion...

📝 → 🔊 • text-to-speech • voice-cloning • emotion-control • 1.4K runs

🤖 Model 📝 → 🔊

lucataco/neutts

Generate speech audio from text with instant voice cloning from a short reference clip. Provide a text prompt and 3–15 s...

📝 → 🔊 • text-to-speech • voice-cloning • 6 runs

🤖 Model 📝 → 🔊

minimax/speech-02-turbo

Generate speech audio from text with low latency for real-time applications. Choose from 300+ prebuilt voices or supply...

📝 → 🔊 • text-to-speech • multilingual-tts • real-time • 5.1M runs

🤖 Model 📝 → 🔊

minimax/speech-02-hd

Generate speech audio from text with multilingual voices, emotion control, and voice cloning. Accepts text (up to 10,000...

📝 → 🔊 • text-to-speech • voice-cloning • 971.1K runs

🤖 Model

minimax/voice-cloning

Clone a speaker’s voice from an audio recording for use with MiniMax text-to-speech models. Input an MP3/M4A/WAV clip (1...

voice-cloning • 19.2K runs

🤖 Model 📝 → 🔊

resemble-ai/chatterbox

Generate expressive speech from text. Optionally clone a target voice from a short reference audio, and control delivery...

📝 → 🔊 • text-to-speech • voice-cloning • audio-watermarking • 139.9K runs

🤖 Model 🎥

heygen/video-translate

Translate videos into other languages while preserving the speaker’s voice. Input a video and get a dubbed video output...

🎥 • video-translation • video-to-video • voice-cloning • 6.3K runs

🤖 Model 📝 → 🔊

ttsds/metavoice

Generate speech from text using a reference speaker audio to clone the speaker’s voice. Accepts a text prompt and a spea...

📝 → 🔊 • text-to-speech • voice-cloning • 646 runs

🤖 Model 📝 → 🔊

ttsds/tortoise

Generate spoken audio from text with optional voice cloning. Accepts text and an optional speaker reference audio clip t...

📝 → 🔊 • text-to-speech • voice-cloning • 1.7K runs

🤖 Model 📝 → 🔊

ttsds/openvoice_2

Generate speech from text with a cloned voice. Provide a text prompt, a target language (en, zh, es, ja, ko, fr), and a...

📝 → 🔊 • text-to-speech • voice-cloning • 786 runs

🤖 Model 📝 → 🔊

jichengdu/llasa

Convert text to speech with zero-shot voice cloning from a reference audio sample. Accepts text and a voice sample and o...

📝 → 🔊 • text-to-speech • voice-cloning • chinese-english • 89 runs

🤖 Model 📝 → 🔊

lucataco/xtts-v2

Clone a voice from a short reference clip and synthesize multilingual speech from text. Provide a text prompt and at lea...

📝 → 🔊 • text-to-speech • voice-cloning • 4.4M runs

🤖 Model 📝 → 🔊

ttsds/xtts_2

Generate speech from text while cloning a target voice from a reference audio sample. Input text and a speaker reference...

📝 → 🔊 • text-to-speech • voice-cloning • 99 runs