lucataco/talking-avatar 🖼️✓❓ → 🖼️

▶️ 446 runs 📅 Aug 2025 ⚙️ Cog 0.14.0
audio-to-video autocaption image-to-video lip-sync talking-avatar video-generation

About

A combination of Seedance, Kling Lipsync and Autocaption to create talking avatars

Example Output

Output

Performance Metrics

444.91s Prediction Time
445.28s Total Time
All Input Parameters
{
  "audio": "https://replicate.delivery/pbxt/NWYEr8QPkBeQZaJ2GkPlU1z1B7CWQJtyUJRoUnpwlluZaZ6g/replicate-prediction-gz0dq0zj71rma0crky99mj4q0r.mp3",
  "image": "https://replicate.delivery/pbxt/NWYErO2DeKhMsKFRuV3QKR8lfNh8M4uu69kX0NxC4H3jc9lx/jennai.jpg",
  "captions": true,
  "duration": 10,
  "resolution": "720p"
}
Input Parameters
audio (required) Type: string
Audio file for lip synchronization (.mp3, .wav, .m4a, or .aac)
image (required) Type: string
Input image of a person for the talking avatar
captions Type: booleanDefault: true
Return video with captions
duration Default: 10
duration of the video in seconds
resolution Default: 720p
Video resolution
Output Schema

Output

Type: stringFormat: uri

Example Execution Logs
Generating video from image...
Adding lip synchronization...
Adding captions...
Version Details
Version ID
4071ea9349624cebaf00779c9b1050638e303e147435b25b144ed39dcaeba67a
Version Created
August 12, 2025
Run on Replicate →