lucataco/talking-avatar 🖼️✓❓ → 🖼️

▶️ 789 runs 📅 Aug 2025 ⚙️ Cog 0.14.0

audio-to-video autocaption image-to-video lip-sync talking-avatar video-generation

Performance

444.9sTypical run time

789Total runs

About

A combination of Seedance, Kling Lipsync and Autocaption to create talking avatars

Example Output

Output

Performance Metrics

444.91s Prediction Time

445.28s Total Time

All Input Parameters

{
  "audio": "https://replicate.delivery/pbxt/NWYEr8QPkBeQZaJ2GkPlU1z1B7CWQJtyUJRoUnpwlluZaZ6g/replicate-prediction-gz0dq0zj71rma0crky99mj4q0r.mp3",
  "image": "https://replicate.delivery/pbxt/NWYErO2DeKhMsKFRuV3QKR8lfNh8M4uu69kX0NxC4H3jc9lx/jennai.jpg",
  "captions": true,
  "duration": 10,
  "resolution": "720p"
}

Input Parameters

audio (required) Type: string: Audio file for lip synchronization (.mp3, .wav, .m4a, or .aac)
image (required) Type: string: Input image of a person for the talking avatar
captions Type: booleanDefault: true: Return video with captions
duration Default: 10: duration of the video in seconds
resolution Default: 720p: Video resolution

Output Schema

Output

Type: string • Format: uri

Example Execution Logs

Generating video from image...
Adding lip synchronization...
Adding captions...

Version Details

Version ID: 4071ea9349624cebaf00779c9b1050638e303e147435b25b144ed39dcaeba67a
Version Created: August 12, 2025

Run on Replicate →