declare-lab/tangoflux 🔢📝 → 🖼️
About
Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization
Example Output
Prompt:
"The deep growl of an alligator ripples through the swamp as reeds sway with a soft rustle and a turtle splashes into the murky water"
Output
Performance Metrics
1.39s
Prediction Time
1.39s
Total Time
All Input Parameters
{
"steps": 25,
"prompt": "The deep growl of an alligator ripples through the swamp as reeds sway with a soft rustle and a turtle splashes into the murky water",
"duration": 10,
"guidance_scale": 4.5
}
Input Parameters
- steps
- Number of inference steps
- prompt
- Input prompt
- duration
- Duration of the output audio in seconds
- guidance_scale
- Scale for classifier-free guidance
Output Schema
Output
Example Execution Logs
0%| | 0/25 [00:00<?, ?it/s] 0%| | 0/25 [00:01<?, ?it/s]
Version Details
- Version ID
fcdc421786888a045329d7c4e1874764433a2516b21f4c34bd3da4e054d04cf9- Version Created
- December 31, 2024