kwaivgi/kling-lip-sync 📝🎥❓🖼️🔢 → 🖼️

⭐ Official ▶️ 16.9K runs 📅 May 2025 ⚙️ Cog 0.16.7
lipsync video-editing

About

Add lip-sync to any video with an audio file or text

Example Output

Output

Performance Metrics

635.85s Prediction Time
635.87s Total Time
All Input Parameters
{
  "voice_id": "en_AOT",
  "video_url": "https://replicate.delivery/xezq/ipjGAn65es3cfkPhe7g3IvYQqDfbHeCfBof1ujrrMvb3rQAXKA/tmptucl_ok3.mp4",
  "audio_file": "https://replicate.delivery/pbxt/N245edsFrGTRuk6v5OWFet0nsqiahHTlSF8yRfZEbbZCxSzY/replicate-prediction-sz4ehr9vanrme0cpwnp9wr4g8c.mp3",
  "voice_speed": 1
}
Input Parameters
text Type: string
Text content for lip sync (if not using audio)
video_id Type: string
ID of a video generated by Kling. Cannot be used with video_url.
voice_id Default: en_AOT
Voice ID for speech synthesis (if using text and not audio)
video_url Type: string
URL of a video for lip syncing. It can be an .mp4 or .mov file, should be less than 100MB, with a duration of 2-10 seconds, and a resolution of 720p-1080p (720-1920px dimensions). Cannot be used with video_id.
audio_file Type: string
Audio file for lip sync. Must be .mp3, .wav, .m4a, or .aac and less than 5MB.
voice_speed Type: numberDefault: 1Range: 0.8 - 2
Speech rate (only used if using text and not audio)
Output Schema

Output

Type: stringFormat: uri

Example Execution Logs
Generating lip sync video...
Generated video in 634.3sec
Downloading 6287351 bytes
Downloaded 6.00MB in 1.45sec
Output video duration: 9.97
Version Details
Version ID
0c6c10ac439d5f75fc63957af9984cf43505e4a4117aa53b605b509d54837213
Version Created
September 26, 2025
Run on Replicate →