Kuaivgi Kling v1 AI Avatar Standard — Audio-Driven Talking Portrait
Turn a single portrait into a natural talking-head video driven by your audio. The Standard tier focuses on clean lip-sync and stable identity at a budget-friendly rate—great for explainers, support avatars, internal training, and product demos.
Highlights
- Phoneme-aligned lip-sync with natural eye blinks and head motion
- Identity-preserving generation from one image
- Works with real recordings or TTS audio
- Optional prompt to nudge framing, background vibe, or style
- Fast, reliable outputs suitable for everyday production
Parameters
- audio (required): speech track; duration determines the clip length
- image (required): clear, front-facing portrait (URL or upload)
- prompt (optional): short guidance for mood, background, or framing
Recommended inputs
- Portrait: even lighting, minimal occlusion, 512 px or larger
- Audio: clean voice, 16–48 kHz, avoid heavy music/reverb
How to Use
- Upload or paste the audio URL.
- Upload or paste the portrait image URL.
- (Optional) Add a brief prompt to describe background tone or framing.
- Press Run and download the generated avatar video.
Tips
- Trim long silences to reduce cost and tighten pacing.
- Keep headroom consistent across images if you plan a series.
- Use a high-quality mic or TTS for crisp consonants and better lip-sync.
Pricing
Price per second: $0.05
Billing rules
- Minimum charge: 5 seconds.
- Maximum billable length: 600 seconds (10 minutes) → $30.00 cap.
- Currency rounding: totals are rounded to the nearest cent.