jigsawstack/tts 📝❓ → 🖼️
About
Transform text into natural-sounding human-like AI voices with low latency and exceptional quality.

Example Output
Output
Performance Metrics
0.83s
Prediction Time
6.98s
Total Time
All Input Parameters
{ "text": "hi there, how are you ?", "accent": "en-US-female-27", "api_key": "sk_ba8d51db7829aef7ec445bb80dd59b7ff2c320d851eb6ae4cf41aacfab4df81c690774957983ebcc02f08429fccdfe42b4d8ee6a40b404399c10ca3bcabdbb72024KnIELzWIzzblw1gtM6", "return_type": "binary" }
Input Parameters
- text
- The text to generate speech from. Character Limits: - Standard TTS: 5-1,500 characters - Voice Cloning TTS: 5-500 characters
- accent
- Speaker voice accent. Not required if using voice cloning. Over 700 different voices across multiple languages are available.
- api_key
- 🔐 Your JigsawStack API Key (required)
- return_type
- The specified return type for the response
- voice_clone_id
- The unique identifier for a previously cloned voice. When provided, the API will generate speech using this pre-cloned voice profile instead of creating a new one.
Output Schema
Output
Version Details
- Version ID
d606ddfc5e1541356c2095b71d7d8d69c8188867e33ba2f69b9b53d6702c8f36
- Version Created
- June 24, 2025