zsxkib/kimi-audio-7b-instruct
Transcribe speech and generate spoken replies from an audio input. Accepts an audio file (with an optional text prompt)...
Found 98 models (showing 1-20)
Transcribe speech and generate spoken replies from an audio input. Accepts an audio file (with an optional text prompt)...
Separate a target sound from a mixed audio recording using a natural-language text query. Provide an audio file and a de...
Separate a target sound source from a mixed audio track using an example query audio. Takes a mixture audio and a short...
Upsample and restore audio to 48 kHz from lower-quality inputs. Takes an audio file and returns a higher-fidelity audio...
Denoise and enhance speech in audio recordings. Takes an audio file as input and outputs a cleaned, enhanced audio file...
Generate music from a text prompt. Optionally condition on an input audio clip to mimic its melody or continue the sourc...
Generate music from a text prompt or an input melody. Accepts text and/or a reference audio clip and outputs stereo musi...
Generate choral and choir music from a text prompt, with optional audio conditioning to mimic a melody or continue an in...
Generate music in stereo based on text prompts and chord progressions. Accepts either text-based chord sequences using s...
Generate music from text prompts with chord progression control using either text-based chord sequences or audio-based c...
Generate music from a text prompt. Optionally condition on an input audio clip to mimic its melody or seamlessly continu...
Remix music into new styles from a text prompt and an input audio track. Generate a new backing track conditioned on the...
Generate music from a text prompt. Optionally condition on an input audio clip to continue it or mimic its melody for mu...
Generate music from a text prompt, with optional audio conditioning (melody following) or continuation of an input clip....
Generate music from a text prompt, an input audio reference, or by continuing an audio clip. Accepts a prompt describing...
Generate music from a text prompt. Optionally condition on an input audio clip to continue the track or mimic its melody...
Generate music from a text prompt, with optional audio conditioning for melody mimicry or seamless continuation. Accepts...
Generate stereo music from a text prompt or an input audio reference. Provide a descriptive prompt (genre, instruments,...
Generate music from text prompts with fine-tuning specifically on tracks by Oneohtrix Point Never using the "OPN" text t...
Generates Ancient Greek lyre instrumentals from text prompts with structured musical form. Built on Meta's MusicGen arch...