Kwaivgi Kling V1.6 Multi I2V Standard
Playground
Try it on WavespeedAI!Kling v1.6 Image-to-Video delivers 195% better results than Kling 1.5, with better prompt understanding, physics, and visual effects. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Features
Kling V1.6 Multi Image-to-Video Standard
Transform multiple reference images into cinematic videos with Kling V1.6. This powerful image-to-video model accepts up to 4 reference images, enabling more complex scene composition, character consistency, and creative control — perfect for fashion editorials, product showcases, and storytelling content.
Why It Looks Great
- Multi-image input: Use up to 4 reference images for richer scene composition and better consistency.
- Cinematic motion: Creates smooth, professional-quality movement with dramatic flair.
- Negative prompt support: Exclude unwanted elements for precise control over the output.
- Flexible duration: Generate 5-second or 10-second videos.
- Multiple aspect ratios: Square (1:1), landscape (16:9), or portrait (9:16) formats.
- Prompt Enhancer: Built-in tool to refine and expand your motion descriptions.
- Safety Checker: Optional content filtering for appropriate output.
Parameters
| Parameter | Required | Description |
|---|---|---|
| prompt | Yes | Text description of the motion, scene, and cinematography you want. |
| negative_prompt | No | Elements to avoid in the generated video. |
| images | Yes | Up to 4 reference images to guide generation (upload or URLs). |
| duration | No | Video length: 5 or 10 seconds. Default: 5. |
| aspect_ratio | No | Output format: 1:1, 16:9, or 9:16. Default: 1:1. |
| Enable Safety Checker | No | Toggle content safety filtering. |
How to Use
- Write your prompt — describe the motion, camera work, and atmosphere in detail.
- Use Prompt Enhancer (optional) — click to automatically enrich your description.
- Add negative prompt (optional) — specify elements to exclude.
- Upload reference images — add up to 4 images using the ”+ Add Item” button.
- Set duration — choose 5 or 10 seconds.
- Select aspect ratio — pick the format that fits your use case.
- Run — click the button to generate.
- Download — preview and save your video.
Pricing
Per 5-second billing based on duration.
| Duration | Calculation | Cost |
|---|---|---|
| 5 seconds | 5 ÷ 5 × $0.25 | $0.25 |
| 10 seconds | 10 ÷ 5 × $0.25 | $0.50 |
Volume Examples
| Videos | Duration | Total Cost |
|---|---|---|
| 5 | 5s | $1.25 |
| 5 | 10s | $2.50 |
| 10 | 5s | $2.50 |
| 10 | 10s | $5.00 |
Best Use Cases
- Fashion Editorials — Create runway-style videos with dramatic fabric movement and cinematic lighting.
- Product Showcases — Animate product images with dynamic camera movements.
- Character Animation — Use multiple reference images for consistent character appearance.
- Storytelling — Combine multiple scene elements into cohesive video narratives.
- Social Media Content — Generate eye-catching videos optimized for different platforms.
Example Prompts
- “Model in a flowing silk gown standing on top of a sand dune, golden sunset light illuminating the fabric, camera circling to capture the dress fluttering dramatically in the wind, high-contrast editorial style”
- “Slow camera push in, hair moving gently in the breeze, warm golden hour lighting”
- “Product rotating smoothly on display, studio lighting creating elegant reflections”
- “Dramatic camera orbit around the subject, fabric flowing in slow motion”
- “Cinematic dolly shot revealing the full scene, atmospheric dust particles in the light”
Multi-Image Tips
| Number of Images | Best Use Case |
|---|---|
| 1 image | Simple animation, single subject |
| 2 images | Before/after, transition effects, dual perspectives |
| 3-4 images | Complex scenes, character consistency, multi-element composition |
Pro Tips for Best Results
- Use multiple images when you need consistency across different elements (character + background + props).
- Include cinematic language: “camera circling”, “dolly shot”, “slow motion”, “dramatic lighting”.
- Describe both subject motion and camera movement for more dynamic results.
- Use negative prompts to avoid “blur”, “distortion”, “static”, “jitter”.
- Match aspect ratio to your platform: 1:1 for Instagram, 16:9 for YouTube, 9:16 for TikTok/Reels.
- Start with 5-second videos to test your concept before generating 10-second versions.
Notes
- If using URLs for images, ensure they are publicly accessible. Preview thumbnails confirm successful loading.
- Processing time scales with duration — 10-second videos take longer.
- Enable Safety Checker for content that will be publicly shared.
- Reference images influence style and composition — choose high-quality sources.
Authentication
For authentication details, please refer to the Authentication Guide.
API Endpoints
Submit Task & Query Result
# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/kwaivgi/kling-v1.6-multi-i2v-standard" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
"duration": 5,
"aspect_ratio": "1:1"
}'
# Get the result
curl --location --request GET "https://api.wavespeed.ai/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"
Parameters
Task Submission Parameters
Request Parameters
| Parameter | Type | Required | Default | Range | Description |
|---|---|---|---|---|---|
| prompt | string | Yes | - | The positive prompt for the generation. max length 2500 | |
| negative_prompt | string | No | - | The negative prompt for the generation. | |
| images | array | Yes | [] | 1 ~ 4 items | A list of images to use as style references. |
| duration | integer | No | 5 | 5, 10 | The duration of the generated media in seconds. |
| aspect_ratio | string | No | 1:1 | 1:1, 16:9, 9:16 | The aspect ratio of the generated media. |
Response Parameters
| Parameter | Type | Description |
|---|---|---|
| code | integer | HTTP status code (e.g., 200 for success) |
| message | string | Status message (e.g., “success”) |
| data.id | string | Unique identifier for the prediction, Task Id |
| data.model | string | Model ID used for the prediction |
| data.outputs | array | Array of URLs to the generated content (empty when status is not completed) |
| data.urls | object | Object containing related API endpoints |
| data.urls.get | string | URL to retrieve the prediction result |
| data.has_nsfw_contents | array | Array of boolean values indicating NSFW detection for each output |
| data.status | string | Status of the task: created, processing, completed, or failed |
| data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
| data.error | string | Error message (empty if no error occurred) |
| data.timings | object | Object containing timing details |
| data.timings.inference | integer | Inference time in milliseconds |
Result Request Parameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
| id | string | Yes | - | Task ID |
Result Response Parameters
| Parameter | Type | Description |
|---|---|---|
| code | integer | HTTP status code (e.g., 200 for success) |
| message | string | Status message (e.g., “success”) |
| data | object | The prediction data object containing all details |
| data.id | string | Unique identifier for the prediction, the ID of the prediction to get |
| data.model | string | Model ID used for the prediction |
| data.outputs | string | Array of URLs to the generated content (empty when status is not completed). |
| data.urls | object | Object containing related API endpoints |
| data.urls.get | string | URL to retrieve the prediction result |
| data.status | string | Status of the task: created, processing, completed, or failed |
| data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
| data.error | string | Error message (empty if no error occurred) |
| data.timings | object | Object containing timing details |
| data.timings.inference | integer | Inference time in milliseconds |