
digital-human
Idle
Votre requête coûtera $0.15 par exécution.
Pour $10 vous pouvez exécuter ce modèle environ 66 fois.
InfiniteTalk produces videos with precise lip sync, aligning the head, face, and body movements to the audio. It maintains identity across unlimited-length videos and also offers image-to-video generation, turning static photos into lively speaking or singing videos.
Accurate lip synchronization: aligns lip motion precisely with audio, preserving natural rhythm and pronunciation.
Full-body coherence: captures head movements, facial expressions, and posture changes beyond the lips.
Identity preservation: maintains consistent facial identity and visual style across frames.
Image-to-video capability: turns static photos into realistic speaking or singing videos.
Instruction following: accepts text prompts to control scene, pose, or behavior while syncing to audio.
| Output Resolution | Cost per 5 seconds | Max Length |
|---|---|---|
| 480p | $0.15 | 10 minutes |
| 720p | $0.30 | 10 minutes |
Do not upload the full image as mask_image. The mask should only cover the regions you want to animate—otherwise the result may render as fully black.