Sync React 1 | AI Digital Human API

sync/react-1 (Audio-to-Video Lip Sync & Facial Animation)

sync/react-1 is a production-ready audio-driven video animation model that syncs a subject in a video to an input audio track. It supports selectable emotion control and multiple animation modes (lips / face / head) to help you generate natural, expressive results for short clips with minimal setup.

Why it stands out

Audio-to-video sync for fast talking-head and reaction style outputs.
Emotion presets (happy / sad / angry / disgusted / surprised / neutral) to steer overall expression.
Multiple control modes so you can animate only lips, or drive broader face/head motion.
Simple workflow and predictable pricing for quick iteration.

Capabilities

Audio-driven lip sync for an input video
Emotion-conditioned expression steering
Mode control for different animation scopes:
lips: focus on mouth movement
face: include facial expression changes
head: include head motion cues (where supported by the model)

Parameters

Parameter	Description
video*	Input video file or public URL.
audio*	Input audio file or public URL.
emotion	Expression preset: happy / sad / angry / disgusted / surprised / neutral.
model_mode	Animation scope: lips / face / head.

Pricing

Video Duration (s)	Total Price
1	$0.167
2	$0.334
3	$0.501
4	$0.668
5	$0.835

How to use

Upload the video (best with a clear, front-facing subject).
Upload the audio (speech, voiceover, or short dialogue).
Choose emotion to steer expression tone.
Choose model_mode (lips / face / head).
Run the model and download the synced result.

Best Use Cases

Short talking-head clips for creators and social media
Dubbing and voiceover sync for character shots
Expressive reaction clips with controlled emotion
Rapid prototyping for dialogue-driven video concepts

Notes

Best results: single subject, stable lighting, minimal motion blur, and a visible face.
Use lips mode for the most conservative edits; use face/head when you want stronger performance and expression.
Very long videos are billed at the 5-second cap, so trim to the segment you want to animate.

More Digital Human Models

wavespeed-ai/infinitetalk — Create realistic talking-head digital humans from a single portrait and audio, delivering stable lip sync and natural facial motion for voice-driven avatar videos.
wavespeed-ai/infinitetalk/multi — Multi-person talking avatar generation that syncs multiple faces to audio with consistent expressions and timing, ideal for dialogues, interviews, and group scenes.
kwaivgi/kling-v2-ai-avatar-pro — Pro-grade AI avatar video generation for high-fidelity digital humans with strong identity consistency and polished, production-ready results for marketing and creator content.

React 1 API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/sync/react-1 with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for React 1 below.

HTTP example

# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/sync/react-1" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d '{
    "video": "https://example.com/your-input.mp4",
    "audio": "https://example.com/your-audio.mp3",
    "emotion": "neutral",
    "model_mode": "face"
}'

# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY"

# When status is "completed", read the output from data.outputs[0].

Node.js example

// npm install wavespeed
const WaveSpeed = require('wavespeed');

const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env

const result = await client.run("sync/react-1", {
        "video": "https://example.com/your-input.mp4",
        "audio": "https://example.com/your-audio.mp3",
        "emotion": "neutral",
        "model_mode": "face"
});

console.log(result.outputs[0]); // → URL of the generated output

Python example

# pip install wavespeed
import wavespeed

output = wavespeed.run(
    "sync/react-1",
    {
    "video": "https://example.com/your-input.mp4",
    "audio": "https://example.com/your-audio.mp3",
    "emotion": "neutral",
    "model_mode": "face"
}
)

print(output["outputs"][0])  # → URL of the generated output

React 1 API — Frequently asked questions

What is the React 1 API?

React 1 is a Sync model for talking-avatar generation, exposed as a REST API on WaveSpeedAI. Sync React-1 is a production-grade video-to-video lip-sync model. It maps any speech track to a target face, producing phoneme-accurate visemes and smooth timing while preserving identity, head pose, lighting, and background. Supports emotion and intensity control, multilingual speech, and long takes for talking-head content. Built for stable production use with a ready-to-use REST API, no cold starts, and predictable pricing. You can call it programmatically or try it from the playground above.

How do I call the React 1 API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/sync/sync-react-1.

How much does React 1 cost per run?

React 1 starts at $0.83 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does React 1 accept?

Key inputs: `video`, `audio`, `emotion`, `model_mode`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/sync/sync-react-1.

How long does React 1 take to generate?

Average end-to-end generation time on WaveSpeedAI is around 379 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.

Can I use React 1 outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (Sync). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.

示例查看全部

相关模型

README