mirelo/video-to-sfx-v1 🔢🖼️📝 → 🖼️

⭐ Official ▶️ 2.8K runs 📅 Sep 2025 ⚙️ Cog 0.16.9
foley sound-effect-generation video-editing video-to-audio

About

Generate synced sounds for any video, and return it with its new sound track

Example Output

Output

Performance Metrics

24.65s Prediction Time
33.44s Total Time
All Input Parameters
{
  "video_path": "https://replicate.delivery/pbxt/NjAV16ybJgqVddjbI73DMEqSyOI9G7ONpSaTqbENgg0aHKrL/video_1.mp4",
  "num_samples": 4,
  "text_prompt": ""
}
Input Parameters
seed Type: integer
Random seed for reproducibility. Leave blank (None) or use -1 for random seed, or any integer for deterministic results.
steps Type: integerDefault: 25Range: 1 - 30
Number of processing steps for the generation model. Higher values may improve quality but take longer.
duration Type: integerDefault: 10Range: 1 - 10
Duration of the generated sound effects in seconds.
video_path (required) Type: string
Video file to process for sound effects. Video will be trimmed to 10 sec if longer
num_samples Type: integerDefault: 2Range: 1 - 4
Number of sound effects to generate. Each sample will be a different variation.
text_prompt Type: stringDefault:
Text prompt to guide sound effect generation. Optional text to guide the sound generation process.
start_offset Type: numberDefault: 0Range: 0 - 300
Starting point in the video (in seconds) from which to generate audio. 0 means start from the beginning.
creativity_coef Type: numberDefault: 4.5Range: 1 - 10
Creativity coefficient to control the creativity of the generated sound. Higher values are more creative.
Output Schema

Output

Type: arrayItems Type: stringItems Format: uri

Example Execution Logs
INFO:utils:Using seed: 957184 for generation.
Version Details
Version ID
34ea7bf892b6e989be29a9ee613d36a31d62fcbccc79b8c3f27852e0f01f2f87
Version Created
November 14, 2025
Run on Replicate →