Vidu Q3 और Q3 Pro मॉडल पर 50% छूट · केवल WaveSpeedAI | 20 मई – 2 जून

Kling Image O1

kwaivgi /

Kling Omni Image O1 is Kuaishou's multi-modal image generation model with MVL technology. Supports up to 10 reference images for feature consistency, precise detail editing (add/remove/modify), style control, and series content creation. Perfect for IP character design, comic panels, and brand merchandise. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

image-to-image
Input

Drag & drop करें या upload के लिए click करें

preview

Idle

Change her hair color to light red.

$0.028per run·~35 / $1

Next:

ExamplesView all

Change her hair color to light red.

Change her hair color to light red.

Cinematic post-apocalyptic landscape. Wide shot of the ruins of Times Square, New York, completely overgrown with massive tropical jungle plants and vines. A waterfall is cascading down a crumbled skyscraper. Sunset lighting, volumetric fog, birds flying in the distance, matte painting style, 8k, hyper-detailed.

Cinematic post-apocalyptic landscape. Wide shot of the ruins of Times Square, New York, completely overgrown with massive tropical jungle plants and vines. A waterfall is cascading down a crumbled skyscraper. Sunset lighting, volumetric fog, birds flying in the distance, matte painting style, 8k, hyper-detailed.

A cozy modern coffee shop interior with warm ambient lighting and wood textures. On the table, place a clear glass iced latte topped with whipped cream and caramel drizzle. The drink should look naturally photographed within the environment, with realistic reflections and subtle depth of field.

A cozy modern coffee shop interior with warm ambient lighting and wood textures. On the table, place a clear glass iced latte topped with whipped cream and caramel drizzle. The drink should look naturally photographed within the environment, with realistic reflections and subtle depth of field.

A clean modern kitchen with marble countertops and soft morning sunlight. On the countertop, place a freshly baked croissant on a white ceramic plate. The croissant should look crisp and buttery, integrated naturally with the environment with accurate lighting and shadows.

A clean modern kitchen with marble countertops and soft morning sunlight. On the countertop, place a freshly baked croissant on a white ceramic plate. The croissant should look crisp and buttery, integrated naturally with the environment with accurate lighting and shadows.

Add a Nike cap to the character.

Add a Nike cap to the character.

Related Models

README

Kling Omni Image O1

Kling Omni Image O1 is Kuaishou's advanced multi-modal image generation model, featuring MVL (Multi-modal Visual Language) technology that combines natural language with image references for unprecedented creative control.

🌟 Four Key Advantages

1. Feature Consistency

Maintains subject characteristics across multiple images:

  • Preserved outlines and core elements
  • Consistent color tones and lighting
  • Unified style across series

2. Precise Detail Modifications

Edit images without professional skills:

  • Add new elements naturally
  • Remove unwanted objects cleanly
  • Modify specific details precisely
  • Maintain original style and texture

3. Style Control

Apply and maintain artistic styles:

  • Consistent visual language
  • Brand-aligned aesthetics
  • Cross-image style coherence

4. Rich Imagination

Generate creative variations while preserving identity:

  • New poses and scenarios
  • Environmental changes
  • Creative interpretations

🎯 Use Cases

  • IP Character Design — Create consistent character series
  • Comic Panel Creation — Maintain character identity across panels
  • Brand Merchandise — Unified styling for product lines
  • Image Editing — Professional modifications without skills
  • Series Content — Cohesive visual storytelling

🎬 Core Features

  • Multi-Reference Support — Up to 10 reference images simultaneously
  • Feature Extraction — Intelligent understanding of subject characteristics
  • Cross-Image Consistency — Stable identity across generations
  • Natural Language Control — Guide creation with text prompts

🚀 How to Use

  1. Upload Reference Images Provide 1-10 reference images of your subject.

  2. Describe Your Intent Write a prompt for the desired output.

Example: "The character in a winter coat, standing in a snowy forest, same art style"

  1. Set Parameters Choose resolution and output format.

  2. Generate Receive images with consistent subject features.

💡 Pro Tips

  • Use multiple angles of the same subject for better feature extraction
  • Provide clear, high-resolution reference images
  • Specify style elements you want to maintain
  • For character series, include various expressions and poses in references

Price

  • $0.028 per run

📝 Example Workflows

WorkflowDescription
Character SeriesCreate consistent characters across different scenes
Product VariationsGenerate product images with unified branding
Comic CreationMaintain character identity across story panels
Style TransferApply consistent artistic style to new subjects
Accessibility:This website uses AI models provided by third parties.

Kling Image O1 API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/kwaivgi/kling-image-o1 with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Kling Image O1 below.

HTTP example
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/kwaivgi/kling-image-o1" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d '{
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "aspect_ratio": "1:1",
    "resolution": "1k",
    "num_images": 1
}'

# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY"

# When status is "completed", read the output from data.outputs[0].
Node.js example
// npm install wavespeed
const WaveSpeed = require('wavespeed');

const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env

const result = await client.run("kwaivgi/kling-image-o1", {
        "prompt": "A cinematic shot of a city at sunset, soft golden light",
        "aspect_ratio": "1:1",
        "resolution": "1k",
        "num_images": 1
});

console.log(result.outputs[0]); // → URL of the generated output
Python example
# pip install wavespeed
import wavespeed

output = wavespeed.run(
    "kwaivgi/kling-image-o1",
    {
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "aspect_ratio": "1:1",
    "resolution": "1k",
    "num_images": 1
}
)

print(output["outputs"][0])  # → URL of the generated output

Kling Image O1 API — Frequently asked questions

What is the Kling Image O1 API?

Kling Image O1 is a Kuaishou model for image editing, exposed as a REST API on WaveSpeedAI. Kling Omni Image O1 is Kuaishou's multi-modal image generation model with MVL technology. Supports up to 10 reference images for feature consistency, precise detail editing (add/remove/modify), style control, and series content creation. Perfect for IP character design, comic panels, and brand merchandise. Ready-to-use REST API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.

How do I call the Kling Image O1 API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/kwaivgi/kwaivgi-kling-image-o1.

How much does Kling Image O1 cost per run?

Kling Image O1 starts at $0.028 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does Kling Image O1 accept?

Key inputs: `prompt`, `images`, `aspect_ratio`, `resolution`, `num_images`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/kwaivgi/kwaivgi-kling-image-o1.

How long does Kling Image O1 take to generate?

Average end-to-end generation time on WaveSpeedAI is around 58 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.

Can I use Kling Image O1 outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (Kuaishou). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.