zsxkib/dia
Generate multi-speaker dialogue audio from text. Specify speakers with [S1], [S2], etc., and include non-verbal cues in...
Found 93 models (showing 1-20)
Generate multi-speaker dialogue audio from text. Specify speakers with [S1], [S2], etc., and include non-verbal cues in...
Clone a voice from a short reference sample and synthesize new speech from text. Accepts text to speak, a 3β15s mono ref...
Generate speech and soundtracks from a video input. Condition speech on a provided transcript (text) and optionally a re...
Generate speech from text in the voice of a reference speaker. Takes a text prompt, a speaker reference audio clip, and...
Generate speech from text using a cloned voice from a reference audio sample. Accept text and a speaker reference, then...
Generate speech audio from text, cloning the voice from a provided reference recording. Provide the text to speak, a spe...
Generate speech from text in a cloned voice using a reference audio sample and its transcript. Accepts text plus speaker...
Generate expressive speech from text with zero-shot voice cloning using a reference speaker audio input. Control emotion...
Generate speech audio from text with instant voice cloning from a short reference clip. Provide a text prompt and 3β15 s...
Generate speech audio from text with low latency for real-time applications. Choose from 300+ prebuilt voices or supply...
Generate speech audio from text with multilingual voices, emotion control, and voice cloning. Accepts text (up to 10,000...
Clone a speakerβs voice from an audio recording for use with MiniMax text-to-speech models. Input an MP3/M4A/WAV clip (1...
Generate expressive speech from text. Optionally clone a target voice from a short reference audio, and control delivery...
Translate videos into 150+ languages while preserving the speakerβs voice, returning a dubbed video output. Input a vide...
Generate speech from text using a reference speaker audio to clone the speakerβs voice. Accepts a text prompt and a spea...
Generate spoken audio from text with optional voice cloning. Accepts text and an optional speaker reference audio clip t...
Generate speech from text with a cloned voice. Provide a text prompt, a target language (en, zh, es, ja, ko, fr), and a...
Convert text to speech with zero-shot voice cloning from a reference audio sample. Accepts text and a voice sample and o...
Clone a voice from a short reference clip and synthesize multilingual speech from text. Provide a text prompt and at lea...
Generate speech from text while cloning a target voice from a reference audio sample. Input text and a speaker reference...