sepal/audiogen 🔢📝❓ → 🖼️
About
Generate sounds from a text prompt

Example Output
Prompt:
"Formula f1 cars driving by"
Output
Performance Metrics
13.60s
Prediction Time
452.06s
Total Time
All Input Parameters
{ "top_k": 250, "top_p": 0, "prompt": "Formula f1 cars driving by", "duration": 5, "temperature": 1, "output_format": "mp3", "classifier_free_guidance": 3 }
Input Parameters
- top_k
- Reduces sampling to the k most likely tokens.
- top_p
- Reduces sampling to tokens with cumulative probability of p. When set to `0` (default), top_k sampling is used.
- prompt (required)
- Prompt that describes the sound
- duration
- Max duration of the sound
- temperature
- Controls the 'conservativeness' of the sampling process. Higher temperature means more diversity.
- output_format
- Output format for generated audio.
- classifier_free_guidance
- Increases the influence of inputs on the output. Higher values produce lower-varience outputs that adhere more closely to inputs.
Output Schema
Output
Version Details
- Version ID
154b3e5141493cb1b8cec976d9aa90f2b691137e39ad906d2421b74c2a8c52b8
- Version Created
- August 21, 2023