ttsds/styletts2
Generate speech from text with optional voice cloning from a reference audio sample. Accepts a text prompt and an option...
Found 115 models (showing 101-115)
Generate speech from text with optional voice cloning from a reference audio sample. Accepts a text prompt and an option...
Generate speech from text in the voice of a reference speaker. Provide text and a speaker reference audio sample; receiv...
Synthesize speech from text using a short reference voice clip (zero-shot TTS). Clone a target speakerβs voice in Englis...
Generate speech audio from text, with optional voice cloning conditioned on a reference recording. Accepts text, an opti...
Clone a voice and synthesize speech from text in 17 languages. Provide a short reference speaker clip (at least 6 second...
Convert text to speech conditioned on a reference voice clip. Provide text, a language code (English, German, Spanish, I...
Generate multilingual speech audio from text. Optionally clone a target voice from a speaker reference audio and control...
Generate speech from text with optional voice cloning and emotion control. Accept a text prompt plus an optional 10β30s...
Generate expressive speech from text with optional zero-shot voice cloning from a short reference audio clip. Accepts te...
Convert text to speech with optional zero-shot voice cloning. Accept a text prompt and an optional speaker reference aud...
Generate speech audio from a text input. Choose from multiple preset voices (af_heart, af_bella, af_alloy, af_aoede, af_...
Generate Hololive VTuber-style speech from text or convert a reference audio clip into those voices. Takes text input or...
Generate speech in a cloned voice from text. Provide a short reference audio sample and its transcript (ref_text) to cap...
Clone a speakerβs voice from a 6+ second audio sample and synthesize spoken audio from text in multiple languages. Provi...
Generate Chinese and English speech audio from text. Accepts text and a voice_type and outputs narrated audio. Select vo...