
stability-ai/stable-audio-2.5
Generate music and sound effects from a text prompt. Accept a text description (e.g., genre, BPM, instrumentation, mood)...
Found 19 models (showing 1-19)
Generate music and sound effects from a text prompt. Accept a text description (e.g., genre, BPM, instrumentation, mood)...
Generate audio from a text prompt. Provide a description and target duration to synthesize music, ambient soundscapes, a...
Add synchronized environmental sounds and sound effects to videos from a video input. Accepts a video and an optional te...
Generate audio from a text prompt. Produce sound effects, human speech, and music, with controls for duration (2.5–20s),...
Generate sound effects and short music from a text prompt. Accepts a prompt (with optional negative prompt) and returns...
Generate synced sound effects for a silent video and return the video with a new soundtrack. Takes a video up to 10 seco...
Generate synchronized audio from a video input. Takes a video and an optional text prompt, and produces sound effects or...
Add synchronized sound and ambient effects to videos. Generate context-aware audio from a video input guided by a text p...
Generate sound effects and ambient audio from text prompts. Accept a natural-language description plus options for durat...
Generate music and sound effects from a text prompt. Accepts a prompt and returns short audio clips (10s or 30s), with s...
Generate synchronized Foley sound effects from video with optional text guidance. Takes a video input and an optional pr...
Generate short audio clips and sound effects from a text prompt. Create drum beats, instrument riffs, ambient textures,...
Generate sound effects, ambiences, and general non-speech audio from a text prompt. Provide a description and target dur...
Generate sound effects and general audio from a text prompt. Produce human vocalizations and speech, animal sounds, natu...
Generate music and audio textures from noise using an unconditional audio diffusion model. Configure clip length (second...
Generate algorithmic 8-bit synth music from one-line C expressions. Provide a duration and optionally a sample expressio...
Generate speech and other audio from a text prompt. Produce multilingual speech with automatic language detection and co...
Generate multilingual speech and other audio from a text prompt. Produce spoken dialogue, background noise, simple sound...
Generate contextual audio and foley from a video input. Accept an input video plus optional caption and a chain-of-thoug...