
Nano Banana 2 is live
digital-human
Idle
이 요청에는 $0.025 실행당가 필요합니다.
$1으로 이 모델을 약 40회 실행할 수 있습니다.
sync/lipsync-1.9.0-beta takes an existing video and a separate audio track, then reanimates the speaker’s mouth so the lips match the new speech. It’s a zero-shot lipsync model from Sync Labs—no training or cloning step required.
video* Required. Input video to be edited (URL or upload). Use a shot with a clearly visible face for best results.
audio* Required. Target speech track (URL or upload, e.g. MP3/WAV). The model will align lip movements to this audio.
sync_mode Controls behavior when video and audio durations differ. Options:
Choose how you want the shorter stream to be treated (looped, trimmed, padded with silence, or time-remapped).
Output: a new video where the speaker’s lips follow the uploaded audio.
Rate: $0.025 per second of processed video.
| Clip length (s) | Price (USD) |
|---|---|
| 5 | $0.13 |
| 10 | $0.25 |
| 20 | $0.50 |
| 30 | $0.75 |
| 60 | $1.50 |
You will only be charged for the actual duration of the input video after upload.
WaveSpeedAI / InfiniteTalk WaveSpeedAI’s single-avatar talking-head model that turns one photo plus audio into smooth, lip-synced digital presenter videos for tutorials, marketing, and social content.
WaveSpeedAI / InfiniteTalk Multi Multi-avatar version of InfiniteTalk that drives several characters in one scene from separate audio tracks, ideal for dialog-style explainers, interviews, and role-play videos.
Kwaivgi / Kling V2 AI Avatar Standard Cost-effective Kling-based AI avatar model that generates natural talking-face videos from a single reference image and voice track, suitable for everyday content and customer support.
Kwaivgi / Kling V2 AI Avatar Pro Higher-fidelity Kling V2 avatar model for premium digital humans, offering smoother motion, better lip-sync, and more stable faces for commercials, brand spokespeople, and product demos.