elevenlabs/v2-multilingual
Generate speech audio from text in 30+ languages. Accepts a text prompt and language code, with selectable preset voices...
Found 131 models (showing 121-131)
Generate speech audio from text in 30+ languages. Accepts a text prompt and language code, with selectable preset voices...
Generate expressive speech audio from text input. Choose from preset voices (e.g., Rachel, Drew, Paul, Aria, Domi, Dave,...
Generate multilingual speech audio from text input. Control voice selection (300+ system voices or a custom cloned voice...
Generate speech audio from text with low latency for real-time agents and interactive apps. Accept text input and output...
Hold multi-turn, multimodal conversations grounded in images, audio, video, and text, returning answers as text and opti...
Convert text to speech audio with low latency for real-time voice agents, chatbots, and interactive apps. Generate multi...
Generate speech audio from text input. Produce fast text-to-speech with selectable preset voices (e.g., Rachel, Drew, Ar...
Generate short podcast clips with a talking AI host from a text prompt. Provide a podcaster_prompt, choose a voice (Wise...
Convert text to speech with low latency for voice agents, narration, and interactive applications. Accepts text (up to 5...
Generate speech and soundtracks from a video input. Condition speech on a provided transcript (text) and optionally a re...
Train and fine-tune a GPT-SoVITS voice model for voice cloning and text-to-speech. Input an audio or video dataset to ex...