zsxkib/kimi-audio-7b-instruct
Transcribe speech and generate spoken replies from an audio input. Accepts an audio file (with an optional text prompt)...
Found 94 models (showing 1-20)
Transcribe speech and generate spoken replies from an audio input. Accepts an audio file (with an optional text prompt)...
Separate a target sound from a mixed audio recording using a natural-language text query. Provide an audio file and a de...
Separate a target sound source from a mixed audio track using an example query audio. Takes a mixture audio and a short...
Upsample and restore audio to 48 kHz from lower-quality inputs. Takes an audio file and returns a higher-fidelity audio...
Denoise and enhance speech in audio recordings. Takes an audio file as input and outputs a cleaned, enhanced audio file...
Generate music from a text prompt. Optionally condition on an input audio clip to mimic its melody or continue the sourc...
Generate music from a text prompt or an input melody. Accepts text and/or a reference audio clip and outputs stereo musi...
Generate choral and choir music from a text prompt, with optional audio conditioning to mimic a melody or continue an in...
Generate stereo music from a text prompt with explicit chord control or from chords extracted from an input audio clip....
Generate music from a text prompt constrained by chord progressions, BPM, and time signature. Provide chord control as t...
Generate music from a text prompt. Optionally condition on an input audio clip to mimic its melody or seamlessly continu...
Remix music into new styles from a text prompt and an input audio track. Generate a new backing track conditioned on the...
Generate music from a text prompt. Optionally condition on an input audio clip to continue it or mimic its melody for mu...
Generate music from a text prompt, with optional audio conditioning (melody following) or continuation of an input clip....
Generate music from a text prompt, an input audio reference, or by continuing an audio clip. Accepts a prompt describing...
Generate music from a text prompt. Optionally condition on an input audio clip to continue the track or mimic its melody...
Generate music from a text prompt, with optional audio conditioning for melody mimicry or seamless continuation. Accepts...
Generate stereo music from a text prompt or an input audio reference. Provide a descriptive prompt (genre, instruments,...
Generate music from a text prompt with a Oneohtrix Point Never (OPN) fine-tuned style. Accept a text description and opt...
Generate Ancient Greek lyre instrumentals from a text prompt. Optionally condition on an input audio clip to mimic its m...