
video-to-video
Idle
Your request will cost $0.835 per run.
For $10 you can run this model approximately 11 times.
One more thing::
sync/react-1 is a production-ready audio-driven video animation model that syncs a subject in a video to an input audio track. It supports selectable emotion control and multiple animation modes (lips / face / head) to help you generate natural, expressive results for short clips with minimal setup.
Audio-driven lip sync for an input video
Emotion-conditioned expression steering
Mode control for different animation scopes:
| Parameter | Description |
|---|---|
| video* | Input video file or public URL. |
| audio* | Input audio file or public URL. |
| emotion | Expression preset: happy / sad / angry / disgusted / surprised / neutral. |
| model_mode | Animation scope: lips / face / head. |
| Video Duration (s) | Total Price |
|---|---|
| 1 | $0.167 |
| 2 | $0.334 |
| 3 | $0.501 |
| 4 | $0.668 |
| 5 | $0.835 |
wavespeed-ai/infinitetalk — Create realistic talking-head digital humans from a single portrait and audio, delivering stable lip sync and natural facial motion for voice-driven avatar videos.
wavespeed-ai/infinitetalk/multi — Multi-person talking avatar generation that syncs multiple faces to audio with consistent expressions and timing, ideal for dialogues, interviews, and group scenes.
kwaivgi/kling-v2-ai-avatar-pro — Pro-grade AI avatar video generation for high-fidelity digital humans with strong identity consistency and polished, production-ready results for marketing and creator content.