aiviostudio/salmonn-2025 🔢📝🖼️ → 📝

▶️ 45 runs 📅 Apr 2025 ⚙️ Cog 0.9.13
audio-to-text music-understanding

About

Example Output

Prompt:

"describe the music "

Output

This music can be described as a blend of rock, alternative, and indie genres. It features a mix of electric and acoustic guitars, with a focus on creating a mellow and introspective atmosphere. The lyrics explore themes of love, loss, and the passage of time. The vocals are performed by a male vocalist with a smooth and soulful voice. The overall sound is reminiscent of classic rock bands from the 70s and 80s, with a modern twist. This music would be perfect for a chill night in with friends, or as the soundtrack to a coming-of-age movie.

Performance Metrics

32.97s Prediction Time
364.07s Total Time
All Input Parameters
{
  "top_p": 0.9,
  "prompt": "describe the music ",
  "num_beams": 4,
  "audio_file": "https://replicate.delivery/pbxt/MmG7bUBBIPLhXrG5aHh2oB8fmjmGfhaHpPygHN8X5L0PQWfk/sad-output.mp3",
  "temperature": 1
}
Input Parameters
top_p Type: numberDefault: 0.9
Top p sampling parameter
prompt (required) Type: string
Text prompt to guide the audio understanding
num_beams Type: integerDefault: 4
Number of beams for beam search
audio_file (required) Type: string
Audio file to process
temperature Type: numberDefault: 1
Sampling temperature
Output Schema

Output

Type: string

Example Execution Logs
Processing prompt: describe the music
Audio file path: /tmp/tmp57gwu5ansad-output.mp3
Version Details
Version ID
e6bd4dcfe3601efbcd9f5b5e81a4828eade396e9d2d6362c79070943862664b5
Version Created
April 4, 2025
Run on Replicate →