dessix/moss-ttsd
Generate two-speaker conversational speech from a dialogue script. Accepts text formatted with [S1]/[S2] turns and optio...
Found 93 models (showing 61-80)
Generate two-speaker conversational speech from a dialogue script. Accepts text formatted with [S1]/[S2] turns and optio...
Convert text to speech with optional voice cloning from a reference audio sample. Accepts text and an optional speaker r...
Convert text into speech while cloning a target voice from a reference audio sample. Accept a text prompt to read, a spe...
Clone a voice and synthesize English speech from text using a short reference audio clip. Accepts text plus a 2β10 secon...
Clone voices and synthesize speech from text using a short reference audio clip (10β30 seconds) and its transcript. Acce...
Translate speech between English, French, Spanish, German, Italian, and Chinese while preserving the speakerβs style, pr...
Generate speech from text using a cloned voice taken from a reference audio sample. Takes target text, a speaker referen...
Convert text to speech in a target speakerβs voice using a short reference audio clip. Provide text and a speaker refere...
Create an RVC v2 voice cloning dataset from a YouTube video. Provide a YouTube URL and optionally a dataset name; it dow...
Convert a singing voice in an input audio to sound like a selected target singer. Transform source vocals to the timbre...
Clone a speakerβs voice from a reference audio and synthesize speech from text. Accepts a text prompt and a reference au...
Generate speech audio from text while cloning a target voice from a reference audio sample. Provide the text to speak, a...
Generate speech from text with optional voice cloning from a reference audio sample. Accepts text plus an optional speak...
Generate speech from text in the voice of a reference speaker. Provide text and a speaker reference audio sample; receiv...
Synthesize speech from text using a short reference voice clip (zero-shot TTS). Clone a target speakerβs voice in Englis...
Generate speech audio from text, with optional voice cloning conditioned on a reference recording. Accepts text, an opti...
Clone a voice and synthesize speech from text in 17 languages. Provide a short reference speaker clip (at least 6 second...
Generate speech from text using a reference voice sample. Provide text, a language code (en, de, es, it, ja, pl, pt, tr)...
Generate speech audio from text with optional voice cloning from a reference audio clip. Accept a speaker reference audi...
Generate expressive speech from text with optional voice cloning from a short reference clip. Accept a text prompt and a...