minimax/speech-02-hd
Generate speech audio from text with multilingual voices, emotion control, and voice cloning. Accepts text (up to 10,000...
Found 131 models (showing 21-40)
Generate speech audio from text with multilingual voices, emotion control, and voice cloning. Accepts text (up to 10,000...
Translate speech and text across 100+ languages, returning text and optionally synthesized speech. Accept audio or text...
Transcribe speech and generate spoken replies from an audio input. Accepts an audio file (with an optional text prompt)...
Generate expressive speech from text. Optionally clone a target voice from a short reference audio, and control delivery...
Generate speech from text using a reference speaker audio to clone the speakerβs voice. Accepts a text prompt and a spea...
Generate spoken audio from text with optional voice cloning. Accepts text and an optional speaker reference audio clip t...
Generate speech from text with a cloned voice. Provide a text prompt, a target language (en, zh, es, ja, ko, fr), and a...
Convert text to speech with zero-shot voice cloning from a reference audio sample. Accepts text and a voice sample and o...
Generate conversational speech from text input. Convert text to spoken audio with Sesameβs CSM 1B; choose between two sp...
Clone a voice from a short reference clip and synthesize multilingual speech from text. Provide a text prompt and at lea...
Generate speech from text while cloning a target voice from a reference audio sample. Input text and a speaker reference...
Generate conversational speech for phone-call applications from text input. Select from multiple preset voices (male_voi...
Generate speech audio from text input with low latency. Select from 700+ multilingual voices and accents, or use a voice...
Generate expressive speech from text. Accepts a text prompt and returns spoken audio with controllable emotion and proso...
Generate speech audio from text with optional zero-shot voice cloning using a reference audio sample. Control emotional...
Generate speech and other audio from a text prompt. Produce multilingual speech with automatic language detection and co...
Generate speech, music, background noise, and simple sound effects from a text prompt. Output an audio file, with an opt...
Clone a voice from a short reference clip and generate speech from text. Accepts text and a reference audio sample; outp...
Generate speech audio from text with selectable multilingual voices. Accepts text input, a preset voice, and a speed mul...
Generate expressive speech from text. Accepts text input and outputs spoken audio, with preset voices (tara, dan, josh,...