WaveSpeedAI APIKwaivgiKwaivgi Kling V1 AI Avatar Pro

Kwaivgi Kling V1 Ai Avatar Pro

Kwaivgi Kling V1 Ai Avatar Pro

Playground

Try it on WavespeedAI!

Kling AI Avatar Pro converts audio into talking video portraits; pricing is $1 for the first 5s then $0.20/s up to 600s. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Features

Kuaivgi Kling v1 AI Avatar Pro — Audio-Driven Talking Portrait

kling-v1-ai-avatar-pro turns a single portrait into a realistic talking-head video driven by your audio. It produces clean lip-sync, natural eye blinks, subtle head motion, and expressive timing suitable for ads, product explainers, education, and virtual hosts.


Highlights

  • High-fidelity lip-sync aligned to phonemes and pauses
  • Natural micro-expressions, eye blinks, and head motion for lifelike delivery
  • Works from one image; preserves identity and lighting
  • Optional style guidance via prompt for framing, vibe, and pacing
  • Built for production: stable outputs from licensed training data

Parameters

  • audio (required): speech or voice track. The model derives duration from the audio.
  • image (required): a clear, front-facing portrait (URL or upload).
  • prompt (optional): short guidance for style, mood, camera framing, or background.
  • Photo: frontal face, even lighting, no heavy occlusions; 512 px or larger
  • Audio: clean speech, 16–48 kHz, minimal music or reverb

How to Use

  1. Upload or paste the audio URL.
  2. Upload or paste the portrait image URL.
  3. (Optional) Add a short prompt describing style or background tone.
  4. Press Run and download the generated avatar video.

Tips

  • Trim long silences at the head and tail of the audio for snappier timing and lower cost.
  • For business use, prepare a neutral background and consistent headroom across images.
  • Use high-quality microphones or TTS to avoid muffled consonants.

Pricing

  • Price per second: $0.20

Billing rules

  1. Minimum charge: 5 seconds.
  2. Exact-length billing: after the 5-second minimum, price = audio duration (in seconds) × $0.20, up to the cap.
  3. Maximum billable length: 600 seconds (10 minutes) → $120.00 cap.
  4. Currency rounding: totals are rounded to the nearest cent.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/kwaivgi/kling-v1-ai-avatar-pro" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

ParameterTypeRequiredDefaultRangeDescription
audiostringYes--The audio for generating the output.
imagestringYes-The image for generating the output.
promptstringNo-The positive prompt for the generation.

Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
data.idstringUnique identifier for the prediction, Task Id
data.modelstringModel ID used for the prediction
data.outputsarrayArray of URLs to the generated content (empty when status is not completed)
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.has_nsfw_contentsarrayArray of boolean values indicating NSFW detection for each output
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds

Result Request Parameters

ParameterTypeRequiredDefaultDescription
idstringYes-Task ID

Result Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
dataobjectThe prediction data object containing all details
data.idstringUnique identifier for the prediction, the ID of the prediction to get
data.modelstringModel ID used for the prediction
data.outputsstringArray of URLs to the generated content (empty when status is not completed).
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.has_nsfw_contentsarrayArray of boolean values indicating NSFW detection for each output
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds
© 2025 WaveSpeedAI. All rights reserved.