
image-to-video
Idle
Your request will cost $0.15 per run.
For $10 you can run this model approximately 66 times.
One more thing:
InfiniteTalk creates videos with accurate lip sync while matching head, face, and body movements to the audio. It maintains identity across unlimited-length videos and also allows image-to-video creation, transforming static photos into lively speaking or singing videos.
Accurate lip synchronization: aligns lip motion precisely with audio, preserving natural rhythm and pronunciation.
Full-body coherence: captures head movements, facial expressions, and posture changes beyond the lips.
Identity preservation: maintains consistent facial identity and visual style across frames.
Image-to-video capability: turns static photos into realistic speaking or singing videos.
Instruction following: accepts text prompts to control scene, pose, or behavior while syncing to audio.
| Output Resolution | Cost per 5 seconds | Max Length |
|---|---|---|
| 480p | $0.15 | 10 minutes |
| 720p | $0.30 | 10 minutes |
Max clip length per job: up to 10 minutes
Processing speed: approximately 10–30 seconds of wall time per 1 second of video (varies by resolution and queue load)