← Back to search

Sound Effect Generation AI Models

Sound effect generation models create short audio clips from prompts or conditioning inputs. They are useful for game prototypes, video production, UI sounds, ambience, Foley concepts, and rapid sound design exploration.

Compare models by clip length, prompt control, sample rate, noise level, and whether the output loops or layers cleanly. For games and video, timing and editability often matter as much as raw audio quality.

Sort by:

Found 21 models (showing 1-20)

🤖 Model 🔊

stability-ai/stable-audio-2.5

Generate music and sound effects from a text prompt. Optionally extend or inpaint an input audio clip for seamless conti...

🔊 • music-generation • sound-effect-generation • audio-to-audio • 4.9K runs

🤖 Model 📝 → 🔊

smaerdlatigid/stable-audio

Generate audio clips from a text prompt. Input a descriptive prompt and a target duration to synthesize music, ambience,...

📝 → 🔊 • sound-effect-generation • music-generation • text-to-audio • 43 runs

🤖 Model 🎥

zsxkib/mmaudio

Generate synchronized environmental sounds and sound effects from a video input. Produce context-aware audio aligned to...

🎥 • video-to-audio • sound-effect-generation • 3.8M runs

🤖 Model 📝 → 🔊

haoheliu/audio-ldm

Generate audio from a text prompt. Produce sound effects, human speech, and music, with controls for clip duration and t...

📝 → 🔊 • text-to-audio • sound-effect-generation • music-generation • 38.2K runs

🤖 Model 📝 → 🔊

gougouccnu/stable-audio-open-1.0

Generate audio from a text prompt, including sound effects, ambience, and musical textures. Accepts a prompt (and option...

📝 → 🔊 • sound-effect-generation • music-generation • text-to-audio • 583 runs

🤖 Model 🎥

mirelo/video-to-sfx

Generate synced sound effects for a silent video and return the video with a new soundtrack. Take a video input (max 10s...

🎥 • video-to-audio • sound-effect-generation • 2.2K runs

🤖 Model 🎥

acappemin/video-to-audio-and-piano

Generate synchronized audio for a video from a video input and optional text prompt, with an option to produce piano acc...

🎥 • video-to-audio • music-generation • sound-effect-generation • 138 runs

🤖 Model 🎥

zsxkib/mmaudio-t4

Add synchronized environmental sound and foley to videos based on visual content. Accepts a video and an optional text p...

🎥 • video-to-audio • image-to-audio • 1.6K runs

🤖 Model

sepal/audiogen

Generate sound effects from a text prompt. Takes a text description and optional duration (1–10 seconds) and outputs an...

sound-effect-generation • 79.9K runs

🤖 Model 📝 → 🔊

lucataco/magnet

Generates music from text prompts using a non-autoregressive transformer architecture. Offers multiple model sizes (300M...

📝 → 🔊 • music-generation • text-to-audio • sound-effect-generation • 3.1K runs

🤖 Model 🎥

tencent/hunyuanvideo-foley

Generate synchronized Foley sound effects from a video input and optional text prompt. Condition on visual content and s...

🎥 • video-to-audio • sound-effect-generation • foley • 6.7K runs

🤖 Model

stackadoc/stable-audio-open-1.0

Generate short audio samples and sound effects from text prompts. Provide a prompt with optional controls for duration (...

music-generation • sound-effect-generation • 22.5K runs

🤖 Model 📝 → 🔊

declare-lab/tangoflux

Generate audio from a text prompt. Produce 44.1 kHz sound effects and ambiences from natural language with user-set dura...

📝 → 🔊 • text-to-audio • sound-effect-generation • 54.4K runs

🤖 Model 📝 → 🔊

declare-lab/tango

Generate sound effects and general non-music audio from a text prompt. Accepts a textual description and returns an audi...

📝 → 🔊 • text-to-audio • sound-effect-generation • 132.2K runs

🤖 Model

harmonai/dance-diffusion

Generate music and audio textures from noise using an unconditional audio diffusion model. Configure clip length (second...

music-generation • sound-effect-generation • 5.2K runs

🤖 Model

andreasjansson/synth-one-liner

Generates algorithmic 8-bit synthesizer music from automatically created mathematical expressions. Uses grammar-constrai...

music-generation • sound-effect-generation • 214 runs

🤖 Model 📝 → 🔊

pollinations/bark

Generates speech and audio from text prompts using a transformer-based model. Supports multilingual speech generation wi...

📝 → 🔊 • text-to-speech • music-generation • sound-effect-generation • 1.2K runs

🤖 Model 📝 → 🔊

suno-ai/bark

Generate speech, music, background noise, and simple sound effects from a text prompt. Output an audio file, with an opt...

📝 → 🔊 • text-to-speech • music-generation • sound-effect-generation • 302.0K runs

🤖 Model 🎥

zsxkib/thinksound

Generate contextual audio for videos from a video input with optional text guidance, returning a video with a synchroniz...

🎥 • video-to-audio • sound-effect-generation • 6.3K runs

🤖 Model 🎥

mirelo/video-to-sfx-v1.5

Generate synchronized sound effects for silent videos and return the video with a new soundtrack. Takes a video input (u...

🎥 • video-to-audio • sound-effect-generation • 1.1K runs

Before using generated sound effects in production, check licensing, loudness, clipping, looping behavior, and how the sound sits in a full mix. Generate multiple variations so editors or designers can choose the version that fits the scene.