vidu/q3-turbo 🔢✓📝🖼️❓ → 🖼️
About
Fast video generation with text-to-video, image-to-video, and start-end-to-video modes. Up to 16 seconds at 1080p with synchronized audio.
Example Output
Prompt:
"The tiger swims slowly and effortlessly, 4K photorealism."
Output
Performance Metrics
124.22s
Prediction Time
124.23s
Total Time
All Input Parameters
{
"audio": true,
"prompt": "The tiger swims slowly and effortlessly, 4K photorealism.",
"duration": 5,
"resolution": "1080p",
"start_image": "https://replicate.delivery/pbxt/OhS5gjYiMXilQ5ZbvowWeREBzNRyWsefClg7fKG3nDATbYiU/Screenshot%202026-02-25%20at%2011.58.58%E2%80%AFAM.png",
"aspect_ratio": "16:9"
}
Input Parameters
- seed
- Random seed. Set for reproducible generation.
- audio
- Whether to generate audio synchronized with the video (dialogue and sound effects).
- prompt (required)
- Text prompt for video generation. Maximum 5000 characters.
- duration
- Duration of the video in seconds.
- end_image
- End frame image for the video. Must be used together with start_image for start-end-to-video mode. The aspect ratios of start and end images must be similar (ratio between 0.8 and 1.25). Supported formats: png, jpeg, jpg, webp.
- resolution
- Resolution of the output video.
- start_image
- Start frame image for the video. When provided without an end_image, the model runs in image-to-video mode. Supported formats: png, jpeg, jpg, webp.
- aspect_ratio
- Aspect ratio of the output video. Only used in text-to-video mode (ignored when images are provided).
Output Schema
Output
Example Execution Logs
Task created: 927359935694512128 Video generated in 118.8sec Downloading 14179442 bytes Downloaded 13.52MB in 2.66sec Output video duration: 5.042s
Version Details
- Version ID
b733a4429921885ae2572ab171e6e11a237c83cc6053e989111b42a315c6dbb3- Version Created
- April 12, 2026