wan-video/wan-2.5-t2v
Generate videos with audio from a text prompt. Produce 5β10 second clips at 480p, 720p, or 1080p across six aspect ratio...
Found 24 models (showing 1-20)
Generate videos with audio from a text prompt. Produce 5β10 second clips at 480p, 720p, or 1080p across six aspect ratio...
Generate lip-synced character video from an audio clip and a reference image. Animate a still portrait into speech or si...
Generate a waveform video from an audio file. Convert music, podcasts, or voice tracks into a bar-style audio visualizer...
Generate short videos from a text prompt, optimized for fast turnaround. Optionally condition on an input audio clip to...
Generate talking avatars by combining an input image of a person and an audio file for lip synchronization. The model in...
Generate audio-driven talking avatar videos from a single reference image and an input audio track. Synchronize lip move...
Generate a talking-head video with audio from a single portrait image and an audio track. Animate the face with audio-dr...
Generate co-speech gesture animations from audio input using expressive masked audio gesture modeling. This model output...
Generate lip-synced talking avatar videos from an input image and audio file, suitable for UGC, TikTok, and Reels conten...
Generate lip-synced talking-head video from a face image or face video and an audio track. Align mouth movements to spee...
Generate conversational talkingβhead videos from a reference image and one or two audio tracks. Provide an image with on...
Generate lip-synced talking-head video from speech audio. Animate a preset face identity (F1βF8, M1βM6) with an optional...
Generate talking or singing videos from a single image and an audio clip. Provide a portrait, half-body, or full-body im...
Generate lip-synced talking-head videos from an input audio clip. Provide a driving audio file (only the first 20 second...
Create a video from a still image and an audio track. Provide one image and one audio file; the model loops the image fo...
Generate talking head videos from a single image and an audio clip. Animate lip movements and emotion-aligned facial exp...
Generate short videos from a text prompt, optionally syncing lip movements and body motion to an input audio track. Acce...
Animate a single image into a lip-synced talking-head video from an audio clip. Provide a source portrait and speech aud...
Generate audio-reactive video frames from a text prompt and an audio file. Use a Stable Diffusion pipeline to synthesize...
Generate audioβreactive video from text prompts and an input audio file. Provide one or multiple newlineβseparated promp...