lucataco/higgs-audio-v2
Generate expressive, multilingual speech audio from text input. Produce zero-shot multi-speaker dialogues, emotional del...
Found 131 models (showing 1-20)
Generate expressive, multilingual speech audio from text input. Produce zero-shot multi-speaker dialogues, emotional del...
Generate audio from a text prompt. Produce sound effects, human speech, and music, with controls for clip duration and t...
Generate multi-speaker dialogue audio from text. Specify speakers with [S1], [S2], etc., and include non-verbal cues in...
Convert text to speech using MetaVoice-1B, a 1.2 billion parameter audio model trained on 100,000 hours of speech. Input...
Clone a voice from a short reference sample and synthesize new speech from text. Accepts text to speak, a 3–15s mono ref...
Generate speech audio from text. Select from preset voices and control speaking speed; automatically split long inputs f...
Generate imaginative text responses from a prompt and convert them to audio narration with customizable voice options. S...
Generate speech from text using natural-language voice descriptions. Provide a text prompt and a free-form voice descrip...
Generate speech from text in the voice of a reference speaker. Takes a text prompt, a speaker reference audio clip, and...
Generate speech from text using a cloned voice from a reference audio sample. Accept text and a speaker reference, then...
Generate speech audio from text, cloning the voice from a provided reference recording. Provide the text to speak, a spe...
Generate speech from text in a cloned voice using a reference audio sample and its transcript. Accepts text plus speaker...
Generate expressive speech from text with zero-shot voice cloning using a reference speaker audio input. Control emotion...
Generate speech audio from text with natural-language control over voice and acoustics. Provide a script and an optional...
Generate speech audio from text with instant voice cloning from a short reference clip. Provide a text prompt and 3–15 s...
Convert text to speech on CPU with multiple built-in voices. Accepts text and outputs spoken audio, with controls for vo...
Generate speech audio from text with selectable preset voices. Provide a text prompt and choose a voice (af, af_bella, a...
Answer spoken queries with simultaneous text and speech output. Accepts a speech audio input and an optional instruction...
Chat and reason across text, images, audio, and video, outputting text and synthesized speech. Accept text prompts with...
Generate speech audio from text with low latency for real-time applications. Choose from 300+ prebuilt voices or supply...