zsxkib/kimi-audio-7b-instruct
Transcribe speech and generate spoken replies from an audio input. Accepts an audio file (with an optional text prompt)...
Found 7 models (showing 1-7)
Transcribe speech and generate spoken replies from an audio input. Accepts an audio file (with an optional text prompt)...
Classify music audio into genre and instrument tags from an input audio clip. Takes an audio file and returns a list of...
Classify music approachability and engagement from an audio file or YouTube URL. Return either binary labels (low/high),...
Classify music style and genre from audio. Accepts an audio file or YouTube link and predicts the top-N Discogs styles (...
Classify music by genre, mood, and instrumentation from an audio file or YouTube URL. Analyze audio using transfer-learn...
Classify music into up to 400 styles from an audio file or YouTube link, returning the top-N predictions as a bar-chart...
Classify speaker gender from an audio file. Accepts a speech audio clip and returns probabilities for male and female cl...