Kwaivgi Kling V1 Ai Avatar Pro

Playground

Kling AI Avatar Pro converts audio into talking video portraits; pricing is $1 for the first 5s then $0.20/s up to 600s. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Features

Kuaivgi Kling v1 AI Avatar Pro — Audio-Driven Talking Portrait

kling-v1-ai-avatar-pro turns a single portrait into a realistic talking-head video driven by your audio. It produces clean lip-sync, natural eye blinks, subtle head motion, and expressive timing suitable for ads, product explainers, education, and virtual hosts.

Highlights

High-fidelity lip-sync aligned to phonemes and pauses
Natural micro-expressions, eye blinks, and head motion for lifelike delivery
Works from one image; preserves identity and lighting
Optional style guidance via prompt for framing, vibe, and pacing
Built for production: stable outputs from licensed training data

Parameters

audio (required): speech or voice track. The model derives duration from the audio.
image (required): a clear, front-facing portrait (URL or upload).
prompt (optional): short guidance for style, mood, camera framing, or background.

Recommended inputs

Photo: frontal face, even lighting, no heavy occlusions; 512 px or larger
Audio: clean speech, 16–48 kHz, minimal music or reverb

How to Use

Upload or paste the audio URL.
Upload or paste the portrait image URL.
(Optional) Add a short prompt describing style or background tone.
Press Run and download the generated avatar video.

Tips

Trim long silences at the head and tail of the audio for snappier timing and lower cost.
For business use, prepare a neutral background and consistent headroom across images.
Use high-quality microphones or TTS to avoid muffled consonants.

Pricing

Price per second: $0.20

Billing rules

Minimum charge: 5 seconds.
Exact-length billing: after the 5-second minimum, price = audio duration (in seconds) × $0.20, up to the cap.
Maximum billable length: 600 seconds (10 minutes) → $120.00 cap.
Currency rounding: totals are rounded to the nearest cent.

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/kwaivgi/kling-v1-ai-avatar-pro" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

Parameter	Type	Required	Default	Range	Description
image	string	Yes		-	The image for generating the output.
audio	string	Yes	-	-	The audio for generating the output.
prompt	string	No		-	The positive prompt for the generation.

Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data.id	string	Unique identifier for the prediction, Task Id
data.model	string	Model ID used for the prediction
data.outputs	array	Array of URLs to the generated content (empty when status is not `completed`)
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.has_nsfw_contents	array	Array of boolean values indicating NSFW detection for each output
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Result Request Parameters

Parameter	Type	Required	Default	Description
id	string	Yes	-	Task ID

Result Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data	object	The prediction data object containing all details
data.id	string	Unique identifier for the prediction, the ID of the prediction to get
data.model	string	Model ID used for the prediction
data.outputs	string	Array of URLs to the generated content (empty when status is not completed).
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Kwaivgi Kling Text To Audio Kwaivgi Kling V1 AI Avatar Standard