Kling V1 AI Avatar Pro | AI Digital Human API

Kuaivgi Kling v1 AI Avatar Pro — Audio-Driven Talking Portrait

kling-v1-ai-avatar-pro turns a single portrait into a realistic talking-head video driven by your audio. It produces clean lip-sync, natural eye blinks, subtle head motion, and expressive timing suitable for ads, product explainers, education, and virtual hosts.

Highlights

High-fidelity lip-sync aligned to phonemes and pauses
Natural micro-expressions, eye blinks, and head motion for lifelike delivery
Works from one image; preserves identity and lighting
Optional style guidance via prompt for framing, vibe, and pacing
Built for production: stable outputs from licensed training data

Parameters

audio (required): speech or voice track. The model derives duration from the audio.
image (required): a clear, front-facing portrait (URL or upload).
prompt (optional): short guidance for style, mood, camera framing, or background.

Recommended inputs

Photo: frontal face, even lighting, no heavy occlusions; 512 px or larger
Audio: clean speech, 16–48 kHz, minimal music or reverb

How to Use

Upload or paste the audio URL.
Upload or paste the portrait image URL.
(Optional) Add a short prompt describing style or background tone.
Press Run and download the generated avatar video.

Tips

Trim long silences at the head and tail of the audio for snappier timing and lower cost.
For business use, prepare a neutral background and consistent headroom across images.
Use high-quality microphones or TTS to avoid muffled consonants.

Pricing

Price per second: $0.20

Billing rules

Minimum charge: 5 seconds.
Exact-length billing: after the 5-second minimum, price = audio duration (in seconds) × $0.20, up to the cap.
Maximum billable length: 600 seconds (10 minutes) → $120.00 cap.
Currency rounding: totals are rounded to the nearest cent.

Kling v1 Ai Avatar Pro API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/kwaivgi/kling-v1-ai-avatar-pro with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Kling v1 Ai Avatar Pro below.

HTTP example

# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/kwaivgi/kling-v1-ai-avatar-pro" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d '{
    "image": "https://example.com/your-input.jpg",
    "audio": "https://example.com/your-audio.mp3",
    "prompt": "A cinematic shot of a city at sunset, soft golden light"
}'

# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY"

# When status is "completed", read the output from data.outputs[0].

Node.js example

// npm install wavespeed
const WaveSpeed = require('wavespeed');

const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env

const result = await client.run("kwaivgi/kling-v1-ai-avatar-pro", {
        "image": "https://example.com/your-input.jpg",
        "audio": "https://example.com/your-audio.mp3",
        "prompt": "A cinematic shot of a city at sunset, soft golden light"
});

console.log(result.outputs[0]); // → URL of the generated output

Python example

# pip install wavespeed
import wavespeed

output = wavespeed.run(
    "kwaivgi/kling-v1-ai-avatar-pro",
    {
    "image": "https://example.com/your-input.jpg",
    "audio": "https://example.com/your-audio.mp3",
    "prompt": "A cinematic shot of a city at sunset, soft golden light"
}
)

print(output["outputs"][0])  # → URL of the generated output

Kling v1 Ai Avatar Pro API — Frequently asked questions

What is the Kling v1 Ai Avatar Pro API?

Kling v1 Ai Avatar Pro is a Kuaishou model for talking-avatar generation, exposed as a REST API on WaveSpeedAI. Kling AI Avatar Pro converts audio into talking video portraits; pricing is $1 for the first 5s then $0.20/s up to 600s. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.

How do I call the Kling v1 Ai Avatar Pro API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/kwaivgi/kwaivgi-kling-v1-ai-avatar-pro.

How much does Kling v1 Ai Avatar Pro cost per run?

Kling v1 Ai Avatar Pro starts at $0.50 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does Kling v1 Ai Avatar Pro accept?

Key inputs: `prompt`, `image`, `audio`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/kwaivgi/kwaivgi-kling-v1-ai-avatar-pro.

How long does Kling v1 Ai Avatar Pro take to generate?

Average end-to-end generation time on WaveSpeedAI is around 639 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.

Can I use Kling v1 Ai Avatar Pro outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (Kuaishou). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.

ExamplesView all

Related Models

README