
zsxkib/dia
Generate multi-speaker dialogue audio from text. Use [S1], [S2] speaker tags and parentheses for non-verbal cues (laughs...
Found 86 models (showing 1-20)
Generate multi-speaker dialogue audio from text. Use [S1], [S2] speaker tags and parentheses for non-verbal cues (laughs...
Clone a speakerβs voice from a short reference and synthesize new speech from text. Accepts a text prompt, a 3β15 second...
Generate speech and sound effects for a video from text prompts and an optional reference voice. Accepts a video input p...
Generate speech from text using a cloned voice from a reference audio sample. Takes three inputsβtext to speak, a speake...
Generate speech from text using a reference voice sample. Accepts a text prompt and a speaker reference audio (with an o...
Convert text to speech using a reference voice clip. Input a target text, a speaker reference audio sample, and a refere...
Generate speech from text using a reference speaker audio and its transcript for zero-shot voice cloning. Provide target...
Generate expressive speech from text using a reference voice sample (zero-shot voice cloning). Provide text and a short...
Generate speech audio from text with instant voice cloning from a short reference clip. Provide a text prompt and 3β15 s...
Convert text into spoken audio for low-latency, real-time use. Choose from 300+ prebuilt voices or use a cloned voice, w...
Generate speech from text for highβfidelity voiceovers, audiobooks, and narration. Accepts text plus optional voice_id,...
Clone a custom voice from an input audio clip for use with MiniMax speech-02-hd and speech-02-turbo text-to-speech. Prov...
Generate expressive speech from text with optional instant voice cloning. Perform zero-shot TTS by providing a short ref...
Translate videos to a target language while preserving the original speakerβs voice, returning a dubbed video. Input a v...
Generate speech from text in the voice of a reference speaker. Accepts a text prompt and a speaker reference audio clip,...
Generate speech audio from text. Optionally clone a target speaker by supplying a short reference audio clip to synthesi...
Generate speech from text using a reference voice sample. Takes text, a target language (English, Chinese, Spanish, Japa...
Convert text to speech with zero-shot voice cloning from a short reference audio. Accepts a text prompt and a voice samp...
Clone a voice from a short audio sample and generate multilingual speech from text. Accepts a text prompt and a referenc...
Generate speech from text using a reference audio sample to clone the speakerβs voice. Provide text, select a language,...