
zsxkib/dia
Generate multi-speaker dialogue audio from text. Use [S1], [S2] speaker tags and parentheses for non-verbal cues (laughs...
Found 82 models (showing 1-20)
Generate multi-speaker dialogue audio from text. Use [S1], [S2] speaker tags and parentheses for non-verbal cues (laughs...
Clone a speakerβs voice from a short reference and synthesize new speech from text. Accepts a text prompt, a 3β15 second...
Generate speech and sound from a video input. Accept a video and optionally a text transcript to drive video-to-speech (...
Generate speech from text using a cloned voice from a reference audio sample. Provide a text prompt, a reference speaker...
Generate speech from text in a cloned voice using a short reference audio sample. Perform crossβlingual voice cloning to...
Generate speech from text using a reference voice sample. Provide target text, a speaker reference audio clip, and the r...
Generate speech audio from a text input using a reference voice sample. Provide the text to speak, a speaker_reference a...
Generate expressive speech from text with zero-shot voice cloning from a reference audio sample. Accepts text plus speak...
Generate speech audio from text with instant voice cloning from a short reference clip. Provide a text prompt and 3β15 s...
Convert text into spoken audio for low-latency, real-time use. Choose from 300+ prebuilt voices or use a cloned voice, w...
Generate speech from text for highβfidelity voiceovers, audiobooks, and narration. Accepts text plus optional voice_id,...
Clone a custom voice from an input audio clip for use with MiniMax speech-02-hd and speech-02-turbo text-to-speech. Prov...
Generate expressive speech from text with optional instant voice cloning. Perform zero-shot TTS by providing a short ref...
Translate videos to a target language while preserving the original speakerβs voice, returning a dubbed video. Input a v...
Generate speech from text in a target speakerβs voice using a reference audio sample. Accepts a text prompt and a speake...
Convert text into speech, optionally cloning the voice from a reference audio sample. Accepts text and an optional speak...
Generate speech from text in a specific speakerβs voice using a reference audio sample. Provide text, choose a language...
Convert text to speech with zero-shot voice cloning from a short reference audio. Accepts a text prompt and a voice samp...
Clone a voice from a short audio sample and generate multilingual speech from text. Accepts a text prompt and a referenc...
Generate speech from text using a reference audio sample to clone the speakerβs voice. Provide text, select a language,...