
lucataco/higgs-audio-v2
Generate expressive speech audio from text input. Control prosody, emotion, and acoustic context with a scene descriptio...
Found 107 models (showing 1-20)
Generate expressive speech audio from text input. Control prosody, emotion, and acoustic context with a scene descriptio...
Generate audio from a text prompt. Produce sound effects, human speech, and music, with controls for duration (2.5β20s),...
Generate multi-speaker dialogue audio from text. Use [S1], [S2] speaker tags and parentheses for non-verbal cues (laughs...
Convert text to speech using MetaVoice-1B, a 1.2 billion parameter audio model trained on 100,000 hours of speech. Input...
Clone a speakerβs voice from a short reference and synthesize new speech from text. Accepts a text prompt, a 3β15 second...
Convert English text into speech with selectable preset voices and adjustable speaking speed. Accepts text plus optional...
Generate imaginative text responses from a prompt and convert them to audio narration with customizable voice options. S...
Generate speech from text with natural-language control over voice, emotion, pacing, pitch, and background ambience. Acc...
Convert text to speech in the voice of a reference speaker. Takes a text prompt, a speaker reference audio sample, and a...
Generate speech from text in a cloned voice using a reference audio sample. Accepts text plus a speaker reference and ou...
Generate speech from text using a reference voice sample. Provide target text, a speaker reference audio clip, and the r...
Generate speech audio from text using a reference voice sample. Condition on a short speaker_reference recording and its...
Generate expressive speech from text with zero-shot voice cloning using a reference speaker audio sample. Accepts text a...
Generate speech from text with natural-language control of voice and recording style. Accepts a text prompt and a descri...
Generate speech audio from text with instant voice cloning from a short reference clip. Provide a text prompt and 3β15 s...
Convert text to spoken audio with multiple built-in voices. Accepts text with optional exact voice name or male/female/a...
Convert text to spoken audio with selectable preset voices. Accepts a text prompt and a voice choice (af, af_bella, af_s...
Answer spoken questions and return both text and spoken responses. Accepts input speech audio (with an optional instruct...
Chat and analyze across text, images, audio, and video, returning text responses and optional synthesized speech. Accept...
Convert text into spoken audio for low-latency, real-time use. Choose from 300+ prebuilt voices or use a cloned voice, w...