kwaivgi/kling-v2.6 📝❓🖼️✓ → 🖼️

⭐ Official ▶️ 13.3K runs 📅 Dec 2025 ⚙️ Cog 0.16.9
image-to-video-with-audio text-to-video-with-audio

About

Kling 2.6 Pro: Top-tier image-to-video with cinematic visuals, fluid motion, and native audio generation

Example Output

Prompt:

"A cinematic, low-angle tracking shot follows a cyclist from behind as they weave through busy New York City traffic. The camera then smoothly orbits around to the front, capturing the cyclist's determined expression. The cyclist is a young man with a mustache, wearing a white t-shirt, black shorts, white socks, and a blue baseball cap. He is riding a sleek black fixed-gear bicycle. The city streets are filled with iconic yellow taxis and various modern cars, with towering skyscrapers lining the background. Bright, natural sunlight creates sharp contrasts and highlights, enhancing the fast-paced, urban atmosphere. The motion is fluid and dynamic, emphasizing the speed and agility of the cyclist amidst the metropolitan chaos."

Output

Performance Metrics

130.93s Prediction Time
130.94s Total Time
All Input Parameters
{
  "prompt": "A cinematic, low-angle tracking shot follows a cyclist from behind as they weave through busy New York City traffic. The camera then smoothly orbits around to the front, capturing the cyclist's determined expression. The cyclist is a young man with a mustache, wearing a white t-shirt, black shorts, white socks, and a blue baseball cap. He is riding a sleek black fixed-gear bicycle. The city streets are filled with iconic yellow taxis and various modern cars, with towering skyscrapers lining the background. Bright, natural sunlight creates sharp contrasts and highlights, enhancing the fast-paced, urban atmosphere. The motion is fluid and dynamic, emphasizing the speed and agility of the cyclist amidst the metropolitan chaos.",
  "duration": 5,
  "aspect_ratio": "16:9",
  "generate_audio": true,
  "negative_prompt": ""
}
Input Parameters
prompt (required) Type: string
Text prompt for video generation
duration Default: 5
Duration of the video in seconds
start_image Type: string
First frame of the video
aspect_ratio Default: 16:9
Aspect ratio of the video. Ignored if start_image is provided.
generate_audio Type: booleanDefault: true
Generate audio for the video. When enabled, the model will create synchronized audio based on the video content.
negative_prompt Type: stringDefault:
Things you do not want to see in the video
Output Schema

Output

Type: stringFormat: uri

Example Execution Logs
Using model: kling-v2-6 in pro mode
Generating video...
Generated video in 122.0sec
Generated video ID: 835475801909895203
Downloading 13749309 bytes
Downloaded 13.11MB in 7.28sec
Version Details
Version ID
b13f36d030496dd78d2986ba8b2b22a44222b3f58c15fb63ef7d6b4aa3a53319
Version Created
December 31, 2025
Run on Replicate →