🤖 Model 📝 → 🔊

camenduru/metavoice
Convert text to speech using MetaVoice-1B, a 1.2 billion parameter audio model trained on 100,000 hours of speech. Input...
Found 4 models (showing 1-4)
Convert text to speech using MetaVoice-1B, a 1.2 billion parameter audio model trained on 100,000 hours of speech. Input...
Generate a talking face video from a single image and an audio file. This model creates a video output where the face in...
Generate speech from text input using an ultra-lightweight, CPU-friendly text-to-speech model. Supports multiple built-i...
Generate psychedelic videos from animal names, transforming them into trippy train visuals with optional psychedelic aud...