wan-video/wan-2.5-t2v
Generate videos with audio from a text prompt. Produce 5–10 second clips at 480p, 720p, or 1080p in six aspect ratios (8...
Found 18 models (showing 1-18)
Generate videos with audio from a text prompt. Produce 5–10 second clips at 480p, 720p, or 1080p in six aspect ratios (8...
Generate audio-driven videos from a text prompt, a reference image, and an audio clip. Synchronize lip movements to spee...
Generate a waveform video from an audio file. Convert music, podcasts, or voice tracks into a bar-style audio visualizer...
Generate short videos from a text prompt, optimized for speed. Optionally condition motion to an input audio track for b...
Generate talking avatars by combining an input image of a person and an audio file for lip synchronization. The model in...
Generate lip-synced avatar videos from a single reference image and an input audio track. Produce identity-preserving ta...
Animate a portrait image into a lip-synced talking-head video driven by an input audio clip. Provide a face image and sp...
Generate co-speech gesture animations from audio input using expressive masked audio gesture modeling. This model output...
Generate lip-synced talking avatar videos from an input image and audio file, suitable for UGC, TikTok, and Reels conten...
Synchronize lip movements in a face image or video to match input audio, generating a talking-head video. Accepts a face...
Generate lip-synced talking-head videos from a reference image and one or two audio tracks. Animate one or two people in...
Generate lip-synced talking-head video from speech audio. Animate a preset face identity (F1–F8, M1–M6) with an optional...
Animate a single image into a talking or performing video synchronized to your audio. Takes an input image of a human su...
Generate photorealistic talking-head video from speech audio. Provide an audio clip and select a preset portrait (May, O...
Create a video from a still image and an audio track. Provide one image and one audio file; the model loops the image fo...
Generate talking-head videos by animating a single image to match an input audio track. Takes a portrait image and speec...
Generate videos from text prompts with optional audio-driven lip-sync and motion guidance. Accepts a text prompt (primar...
Animate a single image into a lip-synced talking-head video from an audio clip. Provide a source portrait and speech aud...