Kling v2.1 I2V Pro — kwaivgi/kling-v2.1-i2v-pro
Kling v2.1 I2V Pro turns a single reference image into a short, cinematic video clip guided by your prompt. Upload an image, describe the motion (subject + camera + environment), and the model animates the scene while keeping the input image as the visual anchor. Built for stable production use with a ready-to-use REST API, no cold starts, and predictable pricing.
Key capabilities
- Image-to-video generation anchored to your input image
- Prompt-controlled motion: facial micro-expressions, hair/clothing movement, environment effects
- Cinematic camera moves: push-in, orbit, pan, tilt, handheld feel
- Supports negative_prompt to reduce artifacts and unwanted styles
Pricing
| Duration | Price |
|---|
| 5s | $0.45 |
| 10s | $0.90 |
| 15s | $1.35 |
| 20s | $1.80 |
Inputs
- image (required): the reference image used as the visual anchor
- prompt (required): describe what moves and how the camera behaves
- negative_prompt (optional): describe what to avoid (blur, distortions, artifacts)
Parameters
- prompt: motion + scene direction for the clip
- negative_prompt: optional “avoid list”
- image: input image (upload or URL)
- guidance_scale: how strongly motion follows your prompt (lower = more natural, higher = more literal)
- duration: video length in seconds
Prompting guide (I2V)
Write prompts like a director’s brief, focusing on motion:
- Subject motion: expression change, breathing, walking, turning, hair swaying
- Environment motion: wind, rain, fog, particles, light rays
- Camera motion: slow push-in, orbit, dolly, handheld micro-shake
- Continuity: keep identity, outfit, and scene layout consistent with the input image
Example prompts
- A cinematic close-up of a woman laughing on a sunny city street. Her hair sways in the wind, coat fabric subtly moves, warm natural light, shallow depth of field, camera slow push-in, smooth motion, 5 seconds.
- Portrait in golden hour. Gentle breeze, subtle facial motion, soft lens flare, handheld micro-sway, realistic skin texture, 5 seconds.
- Moody night street scene. Light rain, drifting mist, neon reflections, camera slow orbit around the subject, 5 seconds.
Negative prompt examples
- blur, distort, low quality
- jitter, warping, melted details, extra limbs
- watermark, logo, subtitles, text artifacts