cjwbw/melotts
Generate speech from text across multiple languages. Accepts text input with language selection (EN, ES, FR, ZH, JP, KR)...
Found 66 models (showing 61-66)
Generate speech from text across multiple languages. Accepts text input with language selection (EN, ES, FR, ZH, JP, KR)...
Generate and chat in multiple languages from a text prompt. Takes a user prompt and optional system prompt and returns t...
Generate speech from text with optional voice cloning and emotion control. Accept a text prompt plus an optional 10–30s...
Transcribe speech from audio into text and subtitles. Accept audio input and return plain text transcripts, SRT or VTT s...
Transcribe speech to text from audio input. Accepts an audio file and optionally a source language, returns a transcript...
Transcribe and understand audio with Voxtral Mini 3B, an advanced model that builds upon Ministral-3B. It excels in spee...