thomcle/chatterbox-tts
Generate speech audio from text with optional zero-shot voice cloning using a reference audio sample. Control emotional...
Found 93 models (showing 21-40)
Generate speech audio from text with optional zero-shot voice cloning using a reference audio sample. Control emotional...
Convert spoken audio to a target speakerβs voice using a reference sample. Provide a source audio file (content) and a r...
Clone a voice from a short reference clip and generate speech from text. Accepts text and a reference audio sample; outp...
Generate speech from text using a short voice reference, or edit existing speech with insertion, substitution, and delet...
Synthesize speech from text in a cloned voice using a reference audio sample. Provide a text prompt and speaker referenc...
Generate expressive multilingual speech from text. Accept a text prompt and a language selection (ar, da, de, el, en, es...
Generate English speech from text with zero-shot voice cloning from a 5β10s reference clip. Provide text, a short refere...
Generate speech from text with optional voice cloning from a reference voice sample. Accepts a text prompt and optional...
Convert text to expressive speech, with optional speaker style cloning from a short reference audio. Accepts text input...
Create AI song covers by converting vocals in an input audio track to a target voice with RVC v2. Provide an audio file...
Generate Spanish speech from text by cloning the voice from a reference audio. Provide Spanish text, a reference audio s...
Generate speech from text in a cloned speakerβs voice. Provide a text prompt, a reference audio sample of the target voi...
Generate speech from text with zero-shot and cross-lingual voice cloning, or convert one voice to another from audio inp...
Clone a speaker's voice and synthesize speech from text, including cross-lingual and mixed-lingual output. Accepts refer...
Clone a voice and synthesize speech from text. Provide a reference audio clip and its transcript plus target text to gen...
Generate speech from text conditioned on a reference voice sample. Input text and a speaker reference audio clip, and ou...
Generate spoken audio from text. Clone a target voice by providing a prompt audio sample (voice_cloning mode), or synthe...
Generate speech audio from text with voice cloning from a reference speaker clip. Accepts target text to speak, a speake...
Clone a voice from a short reference audio and synthesize speech from text. Provide at least 6 seconds of speaker audio...
Convert speech to a target voice using RVC v2 voice models. Takes an input speech audio clip and outputs converted audio...