cjwbw/parler-tts 📝 → 🖼️
About
lightweight text-to-speech (TTS) model, trained on 10.5K hours of audio data

Example Output
Prompt:
"Remember - this is only the first iteration of the model! To improve the prosody and naturalness of the speech further, we're scaling up the amount of training data by a factor of five times."
Output
Performance Metrics
16.38s
Prediction Time
145.06s
Total Time
All Input Parameters
{ "prompt": "Remember - this is only the first iteration of the model! To improve the prosody and naturalness of the speech further, we're scaling up the amount of training data by a factor of five times.", "description": "A male speaker with a low-pitched voice delivering his words at a fast pace in a small, confined space with a very clear audio and an animated tone." }
Input Parameters
- prompt
- Text for audio generation
- description
- Provide description of the output audio
Output Schema
Output
Example Execution Logs
Using the model-agnostic default `max_length` (=2580) to control the generation length. We recommend setting `max_new_tokens` to control the maximum length of the generation.
Version Details
- Version ID
bf38249a8cc143b97b5108570d1c81b8321881dd91fe7837877e7dfa3a0fad27
- Version Created
- April 15, 2024