multilingual AI Models - Page 4

cjwbw/melotts

Generate speech from text in multiple languages. Accepts text plus a language code (EN, ES, FR, ZH, JP, KR), optional En...

📝 → 🔊 • text-to-speech • multilingual • 1.4K runs

🤖 Model 📝 → 📝

qubit999/llama3.2-3b-instruct

Generate text responses from prompts using the Llama 3.2 3B instruction-tuned multilingual language model. Supports text...

📝 → 📝 • text-generation • question-answering • code-generation • 56 runs

🤖 Model 📝 → 🔊

jaaari/zonos

Generate expressive speech from text with optional voice cloning from a short reference clip. Accept a text prompt and a...

📝 → 🔊 • text-to-speech • voice-cloning • multilingual-tts • 4.9K runs

🤖 Model

soykertje/whisper

Transcribe speech from audio to text. Handle multilingual transcription with automatic language detection and optional t...

speech-to-text • multilingual • subtitle-generation • 84.7K runs

🤖 Model

nateraw/whisper-large-v3

Transcribe speech to text from audio input. Accepts an audio file and optionally a source language, returns a transcript...

speech-to-text • multilingual • 4.4K runs

🤖 Model 🔊

lucataco/voxtral-mini-3b

Transcribe and understand audio with Voxtral Mini 3B, an advanced model that builds upon Ministral-3B. It excels in spee...

🔊 • audio-transcription • audio-understanding • speech-to-text • 27 runs

🤖 Model 📝 → 🎥

wan-video/wan-2.5-t2v

Generate videos with synchronized audio from text prompts using Alibaba's WAN 2.5 model. Creates fully synchronized vide...

📝 → 🎥 • text-to-video-with-audio • lipsync • multilingual • 35.7K runs

🤖 Model 📝 → 🔊

cuuupid/zonos

Generate speech from text with optional voice cloning from a short reference audio. Accept text plus a 5–30s speaker sam...

📝 → 🔊 • text-to-speech • voice-cloning • multilingual • 267 runs

🤖 Model

ibm-granite/granite-speech-3.3-8b

Transcribe and translate speech to text from audio input. Supports multilingual ASR in English, French, German, Spanish,...

speech-to-text • multilingual • 12.9K runs

🤖 Model 📝 → 📝

meta/meta-llama-3.1-405b-instruct

Generate chat and instruction-following text from prompts. Accepts a text prompt (and optional system prompt) and return...

📝 → 📝 • text-generation • text-translation • code-generation • 7.2M runs

🤖 Model 📝 → 📝

fauzi3007/sahabat-ai-replicate

Generate and chat in Indonesian, English, Sundanese, and Javanese from a text prompt. Takes text input and returns text...

📝 → 📝 • text-generation • text-translation • multilingual • 104 runs

🤖 Model 📝 → 🔊

chenxwh/openvoice

Clone a voice from a short reference clip and generate speech from text. Accepts text and a reference audio sample; outp...

📝 → 🔊 • text-to-speech • voice-cloning • multilingual • 77.8K runs

🤖 Model 📝 → 🔊

chenxwh/cosyvoice2-0.5b

Generate multilingual speech from text with zero-shot voice cloning. Provide a short reference audio clip and its transc...

📝 → 🔊 • text-to-speech • voice-cloning • 6.3K runs

🤖 Model 📝 → 🔊

playht/play-dialog

Generate natural, conversational speech and two-speaker dialogues from text. Choose from preset voices (Angelo, Arsenio,...

📝 → 🔊 • text-to-speech • multi-voice-dialogue • emotion-tts • 26.8K runs

🤖 Model

cudanexus/ocr-surya

Extract text from images and PDFs in 90+ languages. Accept an image or multi-page PDF, a selected language list, and a p...

ocr • text-detection • 6.5K runs

🤖 Model 📝 → 🖼️

bytedance/seedream-3

Generates high-resolution images up to 2K from text prompts with bilingual support for Chinese and English. Excels at cr...

📝 → 🖼️ • text-to-image • 3.4M runs

🤖 Model 📝 → 🔊

elevenlabs/v2-multilingual

Generate multilingual text-to-speech audio from text input. Convert up to ~10,000 characters per request into natural, e...

📝 → 🔊 • text-to-speech • multilingual • 1.2K runs

🤖 Model

jigsawstack/text-translate

Translate text between languages. Accepts a single string or an array of strings (up to 5,000 characters each) and retur...

text-translation • multilingual • 407 runs

🤖 Model 📝 → 🔊

elevenlabs/v3

Generate expressive speech audio from text input. Choose from preset voices (e.g., Rachel, Drew, Paul, Aria, Domi, Dave,...

📝 → 🔊 • text-to-speech • multilingual • 44 runs

🤖 Model 📝 → 📝

lucataco/qwen2-57b-a14b-instruct

Generate text responses based on prompts and conversations. This 57 billion parameter Mixture-of-Experts language model...

📝 → 📝 • text-generation • 1.4K runs