sakemin/musicongen 🔢📝❓ → 🖼️
About
"MusiConGen: Rhythm and chord control for Transformer-based text-to-music generation"

Example Output
Prompt:
"A laid-back blues shuffle with a relaxed tempo, warm guitar tones, and a comfortable groove, perfect for a slow dance or a night in. Instruments: electric guitar, bass, drums."
Output
Performance Metrics
48.22s
Prediction Time
175.86s
Total Time
All Input Parameters
{ "bpm": 120, "top_k": 250, "top_p": 0, "prompt": "A laid-back blues shuffle with a relaxed tempo, warm guitar tones, and a comfortable groove, perfect for a slow dance or a night in. Instruments: electric guitar, bass, drums.", "duration": 30, "time_sig": "4/4", "temperature": 1, "text_chords": "C G A:min F", "output_format": "wav", "classifier_free_guidance": 3 }
Input Parameters
- bpm
- BPM condition for the generated output. Chord and rhythm conditions are generated upon this value. This will be appended at the end of `prompt`.
- seed
- Seed for random number generator. If `None` or `-1`, a random seed will be used.
- top_k
- Reduces sampling to the k most likely tokens.
- top_p
- Reduces sampling to tokens with cumulative probability of p. When set to `0` (default), top_k sampling is used.
- prompt
- A description of the music you want to generate.
- duration
- Duration of the generated audio in seconds.
- time_sig
- Meter value for the generate output. Chord and rhythm conditions are generated upon this value. This will be appended at the end of `prompt`.
- temperature
- Controls the 'conservativeness' of the sampling process. Higher temperature means more diversity.
- text_chords
- A text based chord progression condition. Single uppercase alphabet character(eg. `C`) is considered as a major chord. Chord attributes like(`maj`, `min`, `dim`, `aug`, `min6`, `maj6`, `min7`, `minmaj7`, `maj7`, `7`, `dim7`, `hdim7`, `sus2` and `sus4`) can be added to the root alphabet character after `:`.(eg. `A:min7`) Each chord token splitted by `SPACE` is allocated to a single bar. If more than one chord must be allocated to a single bar, cluster the chords adding with `,` without any `SPACE`.(eg. `C,C:7 G, E:min A:min`)
- output_format
- Output format for generated audio.
- classifier_free_guidance
- Increases the influence of inputs on the output. Higher values produce lower-varience outputs that adhere more closely to inputs.
Output Schema
Output
Example Execution Logs
Using seed 2360678342
Version Details
- Version ID
a05ec8bdf5cc902cd849077d985029ce9b05e3dfb98a2d74accc9c94fdf15747
- Version Created
- August 3, 2024