
minimax/speech-02-hd
Generate speech from text for highβfidelity voiceovers, audiobooks, and narration. Accepts text plus optional voice_id,...
Found 107 models (showing 21-40)
Generate speech from text for highβfidelity voiceovers, audiobooks, and narration. Accepts text plus optional voice_id,...
Translate speech and text across languages, returning translated text and optionally synthesized speech audio. Supports...
Transcribe audio and generate spoken or textual responses from an audio input. Accepts an audio clip and optional text p...
Generate expressive speech from text with optional instant voice cloning. Perform zero-shot TTS by providing a short ref...
Generate speech in a reference speakerβs voice from text input. Takes a text prompt and a speaker reference audio sample...
Convert text into speech, optionally cloning the voice from a reference audio sample. Accepts text and an optional speak...
Generate speech audio from text using a reference voice sample. Accepts text, a target language (en, zh, es, ja, ko, fr)...
Convert text to speech with zero-shot voice cloning from a short reference audio. Accepts a text prompt and a voice samp...
Generate speech audio from text input. Accept a text prompt and a speaker ID (0 or 1) and return a spoken waveform, with...
Clone a voice from a short audio sample and generate multilingual speech from text. Accepts a text prompt and a referenc...
Generate speech from text in a reference speakerβs voice. Accepts text and a speaker reference audio clip plus a languag...
Generate conversational speech from text for phone-call and IVR applications. Accepts a text prompt and a selectable voi...
Generate speech audio from text. Select from 700+ multilingual voices by accent or use a previously cloned voice via voi...
Generate expressive speech from text input. Choose from built-in voices (Luna, Ember, Hem, Aurora, Cliff, Josh, William...
Generate speech from text with optional zero-shot voice cloning from a reference audio sample. Provide text and an optio...
Generate speech and other audio from a text prompt. Produce multilingual speech with automatic language detection and co...
Generate multilingual speech and other audio from a text prompt. Produce spoken dialogue, background noise, simple sound...
Clone a voice from a reference audio and synthesize speech from text. Provide a short audio sample to capture timbre, th...
Convert text to speech audio with adjustable speed and a wide selection of preset voices. Accepts text input (long passa...
Generate expressive speech from text. Accepts a text prompt and returns spoken audio with controllable emotion and nonve...