
zsxkib/kimi-audio-7b-instruct
Transcribe audio and generate spoken or textual responses from an audio input. Accepts an audio clip and optional text p...
Found 76 models (showing 1-20)
Transcribe audio and generate spoken or textual responses from an audio input. Accepts an audio clip and optional text p...
Separate described sounds from a mixed audio clip using a natural-language text query, outputting an isolated audio trac...
Separate a target sound from a mixture using a query audio example. Provide a mixture audio file and a short reference c...
Upsample audio to 48 kHz from lower-resolution inputs. Takes an audio file and returns a super-resolved 48 kHz audio out...
Enhance and denoise speech audio, outputting a cleaned, improved audio file. Accepts an input audio clip and optionally...
Generate music from a text prompt or continue/mimic an input audio clip. Accepts a natural-language description and opti...
Generate music from a text prompt or a reference audio melody. Accepts text and optionally an input audio clip to either...
Generate chamber choir music from a text prompt. Optionally condition on an input audio clip to continue a section or mi...
Generate stereo music from a text prompt conditioned by chord progressions. Accept text-based chords with bar-level cont...
Generate music from a text prompt constrained by chord progressions, BPM, and time signature. Accept chord conditions as...
Generate music from a text prompt. Optionally condition on an input audio clip to continue it between specified timestam...
Remix songs from an input audio track using a text prompt to generate a new arrangement that follows the original chords...
Generate music from a text prompt. Optionally condition on an input audio clip to mimic its melody or continue it from a...
Generate music from a text prompt, with optional audio conditioning (melody following) or continuation of an input clip....
Generate music from a text prompt, an input audio reference, or by continuing an audio clip. Accepts a prompt describing...
Generate music from a text prompt. Optionally condition on an input audio clip to mimic its melody or continue it, with...
Generate music from a text prompt, with optional audio conditioning for melody mimicry or seamless continuation. Accepts...
Generate short stereo music from a text prompt or by continuing/mimicking an input audio clip. Specify genre, mood, inst...
Generate stereo music from a text prompt. Optionally condition on an input audio clip to mimic its melody or continue a...
Generate Ancient Greek lyre instrumentals from a text prompt. Optionally condition on an input audio clip to continue it...