🤖 Model 🎥 zsxkib/thinksound Generate contextual audio and foley from a video input. Accept an input video plus optional caption and a chain-of-thoug... 🎥 • video-to-audio • sound-effect-generation • 6.1K runs