wan-video/wan-2.5-t2v
Generate videos with synchronized audio from text prompts using Alibaba's WAN 2.5 model. Creates fully synchronized vide...
Found 28 models (showing 1-20)
Generate videos with synchronized audio from text prompts using Alibaba's WAN 2.5 model. Creates fully synchronized vide...
Generates cinematic videos synchronized to audio from a reference image, text prompt, and audio file. Built on the Wan2....
Generate a waveform video from an audio file. Convert music, podcasts, or voice tracks into a bar-style audio visualizer...
Generates videos with synchronized audio from text prompts, optimized for faster generation times compared to the standa...
Generate talking avatars by combining an input image of a person and an audio file for lip synchronization. The model in...
Generate audio-driven talking avatar videos from a single reference image and an input audio track. Synchronize lip move...
Generate a talking-head video with audio from a single portrait image and an audio track. Animate the face with audio-dr...
Generate co-speech gesture animations from audio input using expressive masked audio gesture modeling. This model output...
Generate lip-synced talking avatar videos from an input image and audio file, suitable for UGC, TikTok, and Reels conten...
Generate lip-synced talking-head video from a face image or face video and an audio track. Align mouth movements to spee...
Generate conversational talkingβhead videos from a reference image and one or two audio tracks. Provide an image with on...
Generate lip-synced talking-head video from speech audio. Animate a preset face identity (F1βF8, M1βM6) with an optional...
Generate talking or singing videos from a single image and an audio clip. Provide a portrait, half-body, or full-body im...
Generate lip-synced talking-head videos from an input audio clip. Provide a driving audio file (only the first 20 second...
Create a video from a still image and an audio file. Takes one image and an audio track and outputs a video with the ima...
Generate talking head videos from a single image and an audio clip. Animate lip movements and emotion-aligned facial exp...
Generate short videos from a text prompt, optionally syncing lip movements and body motion to an input audio track. Acce...
Animate a single image into a lip-synced talking-head video from an audio clip. Provide a source portrait and speech aud...
Generate audio-reactive video frames from a text prompt and an audio file. Use a Stable Diffusion pipeline to synthesize...
Generate audioβreactive video from text prompts and an input audio file. Provide one or multiple newlineβseparated promp...