image-to-video
Idle
Your request will cost $0.15 per run.
For $10 you can run this model approximately 66 times.
One more thing:
InfiniteTalk produces videos with precise lip sync, aligning the head, face, and body movements to the audio. It maintains identity across unlimited-length videos and also offers image-to-video generation, turning static photos into lively speaking or singing videos.
Accurate lip synchronization: aligns lip motion precisely with audio, preserving natural rhythm and pronunciation.
Full-body coherence: captures head movements, facial expressions, and posture changes beyond the lips.
Identity preservation: maintains consistent facial identity and visual style across frames.
Image-to-video capability: turns static photos into realistic speaking or singing videos.
Instruction following: accepts text prompts to control scene, pose, or behavior while syncing to audio.
Output Resolution | Cost per 5 seconds | Max Length |
---|---|---|
480p | $0.15 | 10 minutes |
720p | $0.30 | 10 minutes |
Minimum charge: 5 seconds
Per-second rate = (price per 5 seconds) ÷ 5
Billed duration = video length in seconds (rounded up), with a 5-second minimum
Total cost = billed duration × per-second rate (by output resolution)