lucataco/higgs-audio-v2
Generate expressive, multilingual speech audio from text input. Produce zero-shot multi-speaker dialogues, emotional del...
Found 141 models (showing 1-20)
Generate expressive, multilingual speech audio from text input. Produce zero-shot multi-speaker dialogues, emotional del...
Generate audio from a text prompt. Produce sound effects, human speech, and music, with controls for clip duration and t...
Generate multi-speaker dialogue audio from text. Specify speakers with [S1], [S2], etc., and include non-verbal cues in...
Convert text to speech using MetaVoice-1B, a 1.2 billion parameter audio model trained on 100,000 hours of speech. Input...
Clone a voice from a short reference sample and synthesize new speech from text. Accepts text to speak, a 3–15s mono ref...
Generate speech audio from text. Select from preset voices and control speaking speed; automatically split long inputs f...
Generate imaginative text responses from a prompt and convert them to audio narration with customizable voice options. S...
Generate speech from text using natural-language voice descriptions. Provide a text prompt and a free-form voice descrip...
Generate speech from text in the voice of a reference speaker. Takes a text prompt, a speaker reference audio clip, and...
Generate speech from text using a cloned voice from a reference audio sample. Accept text and a speaker reference, then...
Generate speech audio from text, cloning the voice from a provided reference recording. Provide the text to speak, a spe...
Generate speech from text in a cloned voice using a reference audio sample and its transcript. Accepts text plus speaker...
Generate expressive speech from text with zero-shot voice cloning using a reference speaker audio input. Control emotion...
Generate speech audio from text with natural-language control over voice and acoustics. Provide a script and an optional...
Generate speech audio from text with instant voice cloning from a short reference clip. Provide a text prompt and 3–15 s...
Convert text to speech on CPU with multiple built-in voices. Accepts text and outputs spoken audio, with controls for vo...
Generate speech audio from text with selectable preset voices. Provide a text prompt and choose a voice (af, af_bella, a...
Answer spoken queries with simultaneous text and speech output. Accepts a speech audio input and an optional instruction...
Process text, images, audio, and video inputs to generate text and speech responses simultaneously. Features a novel Thi...
Generate speech audio from text with low latency for real-time applications. Choose from 300+ prebuilt voices or supply...