lucataco/talking-avatar 🖼️✓❓ → 🖼️
About
A combination of Seedance, Kling Lipsync and Autocaption to create talking avatars
Example Output
Output
Performance Metrics
444.91s
Prediction Time
445.28s
Total Time
All Input Parameters
{
"audio": "https://replicate.delivery/pbxt/NWYEr8QPkBeQZaJ2GkPlU1z1B7CWQJtyUJRoUnpwlluZaZ6g/replicate-prediction-gz0dq0zj71rma0crky99mj4q0r.mp3",
"image": "https://replicate.delivery/pbxt/NWYErO2DeKhMsKFRuV3QKR8lfNh8M4uu69kX0NxC4H3jc9lx/jennai.jpg",
"captions": true,
"duration": 10,
"resolution": "720p"
}
Input Parameters
- audio (required)
- Audio file for lip synchronization (.mp3, .wav, .m4a, or .aac)
- image (required)
- Input image of a person for the talking avatar
- captions
- Return video with captions
- duration
- duration of the video in seconds
- resolution
- Video resolution
Output Schema
Output
Example Execution Logs
Generating video from image... Adding lip synchronization... Adding captions...
Version Details
- Version ID
4071ea9349624cebaf00779c9b1050638e303e147435b25b144ed39dcaeba67a- Version Created
- August 12, 2025