50% zniżki na modele Vidu Q3 i Q3 Pro · Tylko w WaveSpeedAI | 20 maja – 2 czerwca

Kling V2 AI Avatar Pro

kwaivgi /

Kling V2 AI Avatar Pro generates high-quality AI avatar videos with clean detail, stable motion, and strong identity consistency—ideal for profiles, intros, and social content. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

digital-human
Wejście

Przeciągnij i upuść lub kliknij, aby przesłać

preview

Przeciągnij i upuść lub kliknij, aby przesłać

Bezczynny

$0.56za uruchomienie·~17 / $10

PrzykładyZobacz wszystkie

Powiązane modele

README

Kling-v2-ai-avatar-pro — Talking Avatar from Image + Audio

kling-v2-ai-avatar-pro turns a single portrait into a lip-synced talking-head video driven by your own audio. Upload a clear face image, provide a narration or dialogue track, and the model generates a vertical HD avatar clip that speaks and moves naturally on camera.

🌟 Highlights

  • Audio-driven performance – Uses your uploaded audio as-is (no TTS), keeping timing, pauses and emotion.
  • Photo-real talking avatar – Animates the face, eyes and head while preserving the identity from the reference image.
  • One-shot setup – Just an image + audio; no need for video capture or motion recording.
  • Portrait-ready output – Produces social-ready vertical video that fits Reels, TikTok, Shorts and story formats.
  • Prompt-guided styling (optional) – Use prompt to hint at camera feel or mood (e.g. “soft studio lighting, subtle head movement, gentle smile”).

🔧 Parameters

  • audio* – Required. The voice track that drives lip-sync and timing (URL or upload).
  • image* – Required. A clear, front-facing portrait of the person to animate.
  • prompt – Optional text describing style, expression or camera feel. If omitted, the model uses a neutral talking-head style.

Tip: Use a well-lit, unobstructed face (no heavy motion blur, minimal occlusion) for best identity preservation.

🚀 How to Use

  1. Upload audio
  • Clean mono/stereo track, with minimal background noise.
  • Make sure the final edited length matches what you want in the video.
  1. Upload image
  • Front or 3/4 view, eyes visible, face not cropped.
  • The avatar’s identity and pose come from this image.
  1. (Optional) Add a prompt
  • Guide expression or style, e.g.:

  • “confident presenter in a tech promo, subtle head nods”

  • “friendly customer service tone, warm expression”

  1. Run the model
  • The video length is automatically derived from the audio duration.
  • Download the generated talking-head clip and drop it into your editor or directly onto social platforms.

💰 Pricing

Billing is based on audio duration, with a minimum of 5 seconds.

Audio length (s)Billed secondsPrice (USD)
0–550.56
10101.12
20202.24
30303.36
60606.72

Any clip shorter than 5 seconds is still billed as 5 seconds.

🧠 Tips for Best Results

  • Edit your audio first – Remove mistakes, long silences and background noise before upload.
  • Match tone to use case – Calm, even delivery for corporate avatars; more expressive reads for ads or UGC.
  • Keep framing consistent – Use images with similar head size and framing across a campaign for a unified look.
  • Test a few portraits – Small changes in the reference image (lighting, angle) can noticeably change the avatar’s feel.

More Avatar Tools

See our Avatar Tools collection here!

  • infinitetalk – WaveSpeedAI’s Infinitetalk generates lip-synced talking-head avatar videos from your scripts or audio, ideal for virtual presenters and explainer content.

  • Infinitetalk-muti – WaveSpeedAI’s Infinitetalk-Multi extends the avatar pipeline to multi-speaker / multi-segment scenarios, making it easier to script dialogues, panel shots, or batch avatar content.

  • Omni-Human – ’s Omni-Human 1.5 creates high-fidelity digital humans from images and audio, suitable for realistic virtual hosts, brand ambassadors, and training avatars.

Dostępność:Ta strona korzysta z modeli AI udostępnianych przez podmioty trzecie.

Kling v2 Ai Avatar Pro API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/kwaivgi/kling-v2-ai-avatar-pro with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Kling v2 Ai Avatar Pro below.

HTTP example
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/kwaivgi/kling-v2-ai-avatar-pro" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d '{
    "image": "https://example.com/your-input.jpg",
    "audio": "https://example.com/your-audio.mp3",
    "prompt": "A cinematic shot of a city at sunset, soft golden light"
}'

# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY"

# When status is "completed", read the output from data.outputs[0].
Node.js example
// npm install wavespeed
const WaveSpeed = require('wavespeed');

const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env

const result = await client.run("kwaivgi/kling-v2-ai-avatar-pro", {
        "image": "https://example.com/your-input.jpg",
        "audio": "https://example.com/your-audio.mp3",
        "prompt": "A cinematic shot of a city at sunset, soft golden light"
});

console.log(result.outputs[0]); // → URL of the generated output
Python example
# pip install wavespeed
import wavespeed

output = wavespeed.run(
    "kwaivgi/kling-v2-ai-avatar-pro",
    {
    "image": "https://example.com/your-input.jpg",
    "audio": "https://example.com/your-audio.mp3",
    "prompt": "A cinematic shot of a city at sunset, soft golden light"
}
)

print(output["outputs"][0])  # → URL of the generated output

Kling v2 Ai Avatar Pro API — Frequently asked questions

What is the Kling v2 Ai Avatar Pro API?

Kling v2 Ai Avatar Pro is a Kuaishou model for talking-avatar generation, exposed as a REST API on WaveSpeedAI. Kling V2 AI Avatar Pro generates high-quality AI avatar videos with clean detail, stable motion, and strong identity consistency—ideal for profiles, intros, and social content. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.

How do I call the Kling v2 Ai Avatar Pro API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/kwaivgi/kwaivgi-kling-v2-ai-avatar-pro.

How much does Kling v2 Ai Avatar Pro cost per run?

Kling v2 Ai Avatar Pro starts at $0.56 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does Kling v2 Ai Avatar Pro accept?

Key inputs: `prompt`, `image`, `audio`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/kwaivgi/kwaivgi-kling-v2-ai-avatar-pro.

How long does Kling v2 Ai Avatar Pro take to generate?

Average end-to-end generation time on WaveSpeedAI is around 325 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.

Can I use Kling v2 Ai Avatar Pro outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (Kuaishou). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.