
video-to-video
Idle
Your request will cost $0.05 per run.
For $1 you can run this model approximately 20 times.
One more thing::
Lipsync 2.0 is a zero-shot lipsync model that takes an existing video and a separate audio track, then re-animates the mouth so lip movements match the speech. No training or fine-tuning is required, and it preserves the speaker’s style across languages, dubbing scenarios, and character types.
video* Source video to be re-dubbed (URL or upload). Use clips where the face is clearly visible and not heavily occluded.
audio* Target speech audio (URL or upload). The lips will be synced to this track.
sync_mode Strategy for matching video and audio durations when they differ:
Output: a re-synced MP4 video with lips matching the provided audio.
Pricing is linear in video length:
Examples:
| Video length | Price |
|---|---|
| 5 s | $0.25 |
| 10 s | $0.50 |
| 30 s | $1.50 |
| 60 s | $3.00 |
WaveSpeedAI / InfiniteTalk WaveSpeedAI’s single-avatar talking-head model that turns one photo plus audio into smooth, lip-synced digital presenter videos for tutorials, marketing, and social content.
WaveSpeedAI / InfiniteTalk Multi Multi-avatar version of InfiniteTalk that drives several characters in one scene from separate audio tracks, ideal for dialog-style explainers, interviews, and role-play videos.
Kwaivgi / Kling V2 AI Avatar Standard Cost-effective Kling-based AI avatar model that generates natural talking-face videos from a single reference image and voice track, suitable for everyday content and customer support.
Kwaivgi / Kling V2 AI Avatar Pro Higher-fidelity Kling V2 avatar model for premium digital humans, offering smoother motion, better lip-sync, and more stable faces for commercials, brand spokespeople, and product demos.