
zsxkib/kimi-audio-7b-instruct
Transcribe audio and generate spoken or textual responses from an audio input. Accepts an audio clip and optional text p...
Found 7 models (showing 1-7)
Transcribe audio and generate spoken or textual responses from an audio input. Accepts an audio clip and optional text p...
Tag music audio with genre and instrument labels. Accepts an audio clip and returns a list of predicted tags such as gen...
Classify music approachability and engagement from an audio file or YouTube URL. Return two-class (low/high), three-clas...
Classify music styles from audio. Accepts a YouTube URL or audio file and predicts the top N styles among 400 Discogs st...
Classify music by genre, mood, and instrumentation from an audio file or YouTube URL. Return predicted tags and confiden...
Classify music styles from audio or a YouTube URL and return top-N genre predictions as JSON or a bar-chart visualizatio...
Classify speaker gender from an audio file. Accepts a speech audio clip and returns probabilities for male and female cl...