Vidu Contest
WaveSpeed.ai
Beranda/Jelajahi/Kling Models/kwaivgi/kling-v1-ai-avatar-pro
digital-human

digital-human

Kwaivgi Kling V1 AI Avatar Pro

kwaivgi/kling-v1-ai-avatar-pro

Kling AI Avatar Pro converts audio into talking video portraits; pricing is $1 for the first 5s then $0.20/s up to 600s. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Input

Hint: You can drag and drop a file or click to upload

Hint: You can drag and drop a file or click to upload

preview

Idle

Permintaan Anda akan membutuhkan $0.5 per run.

Untuk $10 Anda dapat menjalankan model ini sekitar 20 kali.

ContohLihat semua

README

Kuaivgi Kling v1 AI Avatar Pro — Audio-Driven Talking Portrait

kling-v1-ai-avatar-pro turns a single portrait into a realistic talking-head video driven by your audio. It produces clean lip-sync, natural eye blinks, subtle head motion, and expressive timing suitable for ads, product explainers, education, and virtual hosts.

Highlights

  • High-fidelity lip-sync aligned to phonemes and pauses
  • Natural micro-expressions, eye blinks, and head motion for lifelike delivery
  • Works from one image; preserves identity and lighting
  • Optional style guidance via prompt for framing, vibe, and pacing
  • Built for production: stable outputs from licensed training data

Parameters

  • audio (required): speech or voice track. The model derives duration from the audio.
  • image (required): a clear, front-facing portrait (URL or upload).
  • prompt (optional): short guidance for style, mood, camera framing, or background.

Recommended inputs

  • Photo: frontal face, even lighting, no heavy occlusions; 512 px or larger
  • Audio: clean speech, 16–48 kHz, minimal music or reverb

How to Use

  1. Upload or paste the audio URL.
  2. Upload or paste the portrait image URL.
  3. (Optional) Add a short prompt describing style or background tone.
  4. Press Run and download the generated avatar video.

Tips

  • Trim long silences at the head and tail of the audio for snappier timing and lower cost.
  • For business use, prepare a neutral background and consistent headroom across images.
  • Use high-quality microphones or TTS to avoid muffled consonants.

Pricing

  • Price per second: $0.20

Billing rules

  1. Minimum charge: 5 seconds.
  2. Exact-length billing: after the 5-second minimum, price = audio duration (in seconds) × $0.20, up to the cap.
  3. Maximum billable length: 600 seconds (10 minutes) → $120.00 cap.
  4. Currency rounding: totals are rounded to the nearest cent.