
image-to-video
Idle
Your request will cost $0.075 per run.
For $1 you can run this model approximately 13 times.
One more thing::
infinitetalk-fast produces videos with precise lip sync, aligning the head, face, and body movements to the audio. It maintains identity across unlimited-length videos and also offers image-to-video generation, turning static photos into lively speaking or singing videos.
Accurate lip synchronization: aligns lip motion precisely with audio, preserving natural rhythm and pronunciation.
Full-body coherence: captures head movements, facial expressions, and posture changes beyond the lips.
Identity preservation: maintains consistent facial identity and visual style across frames.
Image-to-video capability: turns static photos into realistic speaking or singing videos.
Instruction following: accepts text prompts to control scene, pose, or behavior while syncing to audio.
| Metric | Value |
|---|---|
| Price per second | $0.015 |
| Minimum billed duration | 5 s |
| Minimum total price | $0.075 |
| Maximum billed duration | 600 s |
| Maximum total price per run | $9.000 |