wan-video/wan-2.5-i2v 🔢🖼️📝❓✓ → 🖼️

⭐ Official ▶️ 24.9K runs 📅 Sep 2025 ⚙️ Cog 0.16.7
image-to-video-with-audio lipsync multilingual

About

Alibaba Wan 2.5 Image to video generation with background audio

Example Output

Prompt:

"A figure skater performing in a surreal underground cavern with bioluminescent water"

Output

Performance Metrics

555.26s Prediction Time
555.28s Total Time
All Input Parameters
{
  "image": "https://replicate.delivery/pbxt/NliewIc4bFNWJIfiDSRMzvozuAl3dLwIx1fldasrYoXpME3g/image.png",
  "prompt": "A figure skater performing in a surreal underground cavern with bioluminescent water",
  "duration": 5,
  "resolution": "720p",
  "negative_prompt": "",
  "enable_prompt_expansion": true
}
Input Parameters
seed Type: integer
Random seed for reproducible generation
audio Type: string
Audio file (wav/mp3, 3-30s, ≤15MB) for voice/music synchronization
image (required) Type: string
Input image for video generation
prompt (required) Type: string
Text prompt for video generation
duration Default: 5
Duration of the generated video in seconds
resolution Default: 720p
Video resolution
negative_prompt Type: stringDefault:
Negative prompt to avoid certain elements
enable_prompt_expansion Type: booleanDefault: true
If set to true, the prompt optimizer will be enabled
Output Schema

Output

Type: stringFormat: uri

Example Execution Logs
Using seed: 2062703732
Generating...
Generated content in 551.2sec
Downloading 5998282 bytes
Downloaded 5.72MB in 3.03sec
Version Details
Version ID
fd3b3dc94a49e3af4fa371ba943722886c5fc93a3694bf7c442e070a429ef05f
Version Created
September 24, 2025
Run on Replicate →