Kwaivgi Kling Image O1
Playground
Try it on WavespeedAI!Kling Omni Image O1 is Kuaishou’s multi-modal image generation model with MVL technology. Supports up to 10 reference images for feature consistency, precise detail editing (add/remove/modify), style control, and series content creation. Perfect for IP character design, comic panels, and brand merchandise. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.
Features
Kling Omni Image O1
Kling Omni Image O1 is Kuaishou’s advanced multi-modal image generation model, featuring MVL (Multi-modal Visual Language) technology that combines natural language with image references for unprecedented creative control.
🌟 Four Key Advantages
1. Feature Consistency
Maintains subject characteristics across multiple images:
- Preserved outlines and core elements
- Consistent color tones and lighting
- Unified style across series
2. Precise Detail Modifications
Edit images without professional skills:
- Add new elements naturally
- Remove unwanted objects cleanly
- Modify specific details precisely
- Maintain original style and texture
3. Style Control
Apply and maintain artistic styles:
- Consistent visual language
- Brand-aligned aesthetics
- Cross-image style coherence
4. Rich Imagination
Generate creative variations while preserving identity:
- New poses and scenarios
- Environmental changes
- Creative interpretations
🎯 Use Cases
- IP Character Design — Create consistent character series
- Comic Panel Creation — Maintain character identity across panels
- Brand Merchandise — Unified styling for product lines
- Image Editing — Professional modifications without skills
- Series Content — Cohesive visual storytelling
🎬 Core Features
- Multi-Reference Support — Up to 10 reference images simultaneously
- Feature Extraction — Intelligent understanding of subject characteristics
- Cross-Image Consistency — Stable identity across generations
- Natural Language Control — Guide creation with text prompts
🚀 How to Use
-
Upload Reference Images Provide 1-10 reference images of your subject.
-
Describe Your Intent Write a prompt for the desired output.
Example: “The character in a winter coat, standing in a snowy forest, same art style”
-
Set Parameters Choose resolution and output format.
-
Generate Receive images with consistent subject features.
💡 Pro Tips
- Use multiple angles of the same subject for better feature extraction
- Provide clear, high-resolution reference images
- Specify style elements you want to maintain
- For character series, include various expressions and poses in references
Price
- $0.028 per run
📝 Example Workflows
| Workflow | Description |
|---|---|
| Character Series | Create consistent characters across different scenes |
| Product Variations | Generate product images with unified branding |
| Comic Creation | Maintain character identity across story panels |
| Style Transfer | Apply consistent artistic style to new subjects |
Authentication
For authentication details, please refer to the Authentication Guide.
API Endpoints
Submit Task & Query Result
# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/kwaivgi/kling-image-o1" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
"aspect_ratio": "1:1",
"resolution": "1k",
"num_images": 1
}'
# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"
Parameters
Task Submission Parameters
Request Parameters
| Parameter | Type | Required | Default | Range | Description |
|---|---|---|---|---|---|
| prompt | string | Yes | - | The positive prompt for the generation. | |
| images | array | No | [] | - | Including reference images of the element, scene, style, etc.max 10 |
| aspect_ratio | string | No | 1:1 | 16:9, 9:16, 1:1, 4:3, 3:4, 3:2, 2:3, 21:9, auto | The aspect ratio of the generated image. |
| resolution | string | No | 1k | 1k, 2k | Image generation resolution |
| num_images | integer | No | 1 | 1 ~ 9 | The number of images to generate. |
Response Parameters
| Parameter | Type | Description |
|---|---|---|
| code | integer | HTTP status code (e.g., 200 for success) |
| message | string | Status message (e.g., “success”) |
| data.id | string | Unique identifier for the prediction, Task Id |
| data.model | string | Model ID used for the prediction |
| data.outputs | array | Array of URLs to the generated content (empty when status is not completed) |
| data.urls | object | Object containing related API endpoints |
| data.urls.get | string | URL to retrieve the prediction result |
| data.has_nsfw_contents | array | Array of boolean values indicating NSFW detection for each output |
| data.status | string | Status of the task: created, processing, completed, or failed |
| data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
| data.error | string | Error message (empty if no error occurred) |
| data.timings | object | Object containing timing details |
| data.timings.inference | integer | Inference time in milliseconds |