Kling Omni Image O1 is Kuaishou's multi-modal image generation model with MVL technology. Supports up to 10 reference images for feature consistency, precise detail editing (add/remove/modify), style control, and series content creation. Perfect for IP character design, comic panels, and brand merchandise. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.
Idle

$0.028per run·~35 / $1

Change her hair color to light red.

Cinematic post-apocalyptic landscape. Wide shot of the ruins of Times Square, New York, completely overgrown with massive tropical jungle plants and vines. A waterfall is cascading down a crumbled skyscraper. Sunset lighting, volumetric fog, birds flying in the distance, matte painting style, 8k, hyper-detailed.

A cozy modern coffee shop interior with warm ambient lighting and wood textures. On the table, place a clear glass iced latte topped with whipped cream and caramel drizzle. The drink should look naturally photographed within the environment, with realistic reflections and subtle depth of field.

A clean modern kitchen with marble countertops and soft morning sunlight. On the countertop, place a freshly baked croissant on a white ceramic plate. The croissant should look crisp and buttery, integrated naturally with the environment with accurate lighting and shadows.

Add a Nike cap to the character.
Kling Omni Image O1 is Kuaishou's advanced multi-modal image generation model, featuring MVL (Multi-modal Visual Language) technology that combines natural language with image references for unprecedented creative control.
Maintains subject characteristics across multiple images:
Edit images without professional skills:
Apply and maintain artistic styles:
Generate creative variations while preserving identity:
Upload Reference Images Provide 1-10 reference images of your subject.
Describe Your Intent Write a prompt for the desired output.
Example: "The character in a winter coat, standing in a snowy forest, same art style"
Set Parameters Choose resolution and output format.
Generate Receive images with consistent subject features.
| Workflow | Description |
|---|---|
| Character Series | Create consistent characters across different scenes |
| Product Variations | Generate product images with unified branding |
| Comic Creation | Maintain character identity across story panels |
| Style Transfer | Apply consistent artistic style to new subjects |
Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/kwaivgi/kling-image-o1 with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Kling Image O1 below.
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/kwaivgi/kling-image-o1" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $WAVESPEED_API_KEY" \
-d '{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"aspect_ratio": "1:1",
"resolution": "1k",
"num_images": 1
}'
# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
-H "Authorization: Bearer $WAVESPEED_API_KEY"
# When status is "completed", read the output from data.outputs[0].// npm install wavespeed
const WaveSpeed = require('wavespeed');
const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env
const result = await client.run("kwaivgi/kling-image-o1", {
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"aspect_ratio": "1:1",
"resolution": "1k",
"num_images": 1
});
console.log(result.outputs[0]); // → URL of the generated output# pip install wavespeed
import wavespeed
output = wavespeed.run(
"kwaivgi/kling-image-o1",
{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"aspect_ratio": "1:1",
"resolution": "1k",
"num_images": 1
}
)
print(output["outputs"][0]) # → URL of the generated outputKling Image O1 is a Kuaishou model for image editing, exposed as a REST API on WaveSpeedAI. Kling Omni Image O1 is Kuaishou's multi-modal image generation model with MVL technology. Supports up to 10 reference images for feature consistency, precise detail editing (add/remove/modify), style control, and series content creation. Perfect for IP character design, comic panels, and brand merchandise. Ready-to-use REST API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.
POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/kwaivgi/kwaivgi-kling-image-o1.
Kling Image O1 starts at $0.028 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.
Key inputs: `prompt`, `images`, `aspect_ratio`, `resolution`, `num_images`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/kwaivgi/kwaivgi-kling-image-o1.
Average end-to-end generation time on WaveSpeedAI is around 58 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.
Commercial usage rights depend on the model's license, set by its provider (Kuaishou). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.