zsxkib/audio-flamingo-3
Analyze audio and answer questions about speech, music, and sound effects. Accepts an audio file and an optional text pr...
Found 19 models (showing 1-19)
Analyze audio and answer questions about speech, music, and sound effects. Accepts an audio file and an optional text pr...
Generate natural-language descriptions of audio from an input audio file and a guiding text prompt. Perform music analys...
Caption and analyze audio and music from an audio file using a text prompt. Provide free-form text responses to instruct...
Classify music audio into genre and instrument tags from an input audio clip. Takes an audio file and returns a list of...
Classify music approachability and engagement from an audio file or YouTube URL. Return either binary labels (low/high),...
Classify music style and genre from audio. Accepts an audio file or YouTube link and predicts the top-N Discogs styles (...
Transcribe piano audio into MIDI note events. Accepts a piano recording and outputs a MIDI file with polyphonic notes, o...
Analyze music structure from an audio file. Return tempo (BPM), beats, downbeats, segment boundaries, and functional seg...
Predict arousal and valence (music emotion) from an audio input. Accepts an audio file or YouTube URL and returns either...
Classify music by genre, mood, and instrumentation from an audio file or YouTube URL. Analyze audio using transfer-learn...
Analyze music to extract song structure, tempo (BPM), and downbeats, and optionally separate stems. Takes an audio file...
Transcribe music audio into MIDI. Accepts an audio file and outputs a MIDI transcription capturing pitches, onsets, and...
Classify music into up to 400 styles from an audio file or YouTube link, returning the top-N predictions as a bar-chart...
Estimate tempo (BPM) from an audio file or YouTube URL. Run Essentiaβs tempo algorithmsβdegara, multifeature, percival,...
Analyze music structure and separate stems from a music audio input. Uses Harmonix models for structure analysis and Dem...
Auto-tune vocals from an input audio file, returning a pitch-corrected audio track and an optional pitch-correction visu...
Transcribe musical audio into MIDI. Accepts an audio file and outputs a MIDI file with note events, capturing pitch, ons...
Transcribe music from an audio input to MIDI. Choose modes for piano transcription (music-piano, music-piano-v2), multi-...
Transcribe saxophone solos from audio or YouTube links into MIDI and MusicXML sheet music. Accept an audio file or YouTu...