
lucataco/higgs-audio-v2
Generate expressive speech audio from text input. Control prosody, emotion, and acoustic context with a scene descriptio...
Found 110 models (showing 1-20)
Generate expressive speech audio from text input. Control prosody, emotion, and acoustic context with a scene descriptio...
Generate audio from a text prompt. Produce sound effects, human speech, and music, with controls for duration (2.5β20s),...
Generate multi-speaker dialogue audio from text. Use [S1], [S2] speaker tags and parentheses for non-verbal cues (laughs...
Convert text to speech using MetaVoice-1B, a 1.2 billion parameter audio model trained on 100,000 hours of speech. Input...
Clone a speakerβs voice from a short reference and synthesize new speech from text. Accepts a text prompt, a 3β15 second...
Convert English text into speech with selectable preset voices and adjustable speaking speed. Accepts text plus optional...
Generate imaginative text responses from a prompt and convert them to audio narration with customizable voice options. S...
Generate speech from text with natural-language control over voice style. Provide the words to speak and a separate voic...
Generate speech from text using a cloned voice from a reference audio sample. Provide a text prompt, a reference speaker...
Generate multilingual speech from text using a reference speakerβs voice. Provide text and a speaker reference audio sam...
Clone a speakerβs voice and synthesize speech from text. Provide the text to speak, a speaker_reference audio sample, an...
Generate speech audio from a text input using a reference voice sample. Provide the text to speak, a speaker_reference a...
Generate expressive speech from text with zero-shot voice cloning from a reference audio sample. Accepts text plus speak...
Generate speech audio from text with natural-language control over voice and recording style. Provide a script (prompt)...
Generate speech audio from text with instant voice cloning from a short reference clip. Provide a text prompt and 3β15 s...
Convert text to spoken audio with multiple built-in voices. Accepts text with optional exact voice name or male/female/a...
Convert text to spoken audio. Accepts a text prompt and a selectable voice (af_bella, af_sarah, am_adam, am_michael, bf_...
Answer spoken questions with both text and synthesized speech. Accept a speech audio clip and an optional instruction pr...
Chat with a multimodal assistant that understands text, images, audio, and video inputs and returns text plus optional s...
Convert text into spoken audio for low-latency, real-time use. Choose from 300+ prebuilt voices or use a cloned voice, w...