WaveSpeedAI APIKwaivgiKwaivgi Kling Image O1

Kwaivgi Kling Image O1

Kwaivgi Kling Image O1

Playground

Try it on WavespeedAI!

Kling Omni Image O1 is Kuaishou’s multi-modal image generation model with MVL technology. Supports up to 10 reference images for feature consistency, precise detail editing (add/remove/modify), style control, and series content creation. Perfect for IP character design, comic panels, and brand merchandise. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

Features

Kling Omni Image O1

Kling Omni Image O1 is Kuaishou’s advanced multi-modal image generation model, featuring MVL (Multi-modal Visual Language) technology that combines natural language with image references for unprecedented creative control.


🌟 Four Key Advantages

1. Feature Consistency

Maintains subject characteristics across multiple images:

  • Preserved outlines and core elements
  • Consistent color tones and lighting
  • Unified style across series

2. Precise Detail Modifications

Edit images without professional skills:

  • Add new elements naturally
  • Remove unwanted objects cleanly
  • Modify specific details precisely
  • Maintain original style and texture

3. Style Control

Apply and maintain artistic styles:

  • Consistent visual language
  • Brand-aligned aesthetics
  • Cross-image style coherence

4. Rich Imagination

Generate creative variations while preserving identity:

  • New poses and scenarios
  • Environmental changes
  • Creative interpretations

🎯 Use Cases

  • IP Character Design — Create consistent character series
  • Comic Panel Creation — Maintain character identity across panels
  • Brand Merchandise — Unified styling for product lines
  • Image Editing — Professional modifications without skills
  • Series Content — Cohesive visual storytelling

🎬 Core Features

  • Multi-Reference Support — Up to 10 reference images simultaneously
  • Feature Extraction — Intelligent understanding of subject characteristics
  • Cross-Image Consistency — Stable identity across generations
  • Natural Language Control — Guide creation with text prompts

🚀 How to Use

  1. Upload Reference Images Provide 1-10 reference images of your subject.

  2. Describe Your Intent Write a prompt for the desired output.

    Example: “The character in a winter coat, standing in a snowy forest, same art style”

  3. Set Parameters Choose resolution and output format.

  4. Generate Receive images with consistent subject features.


💡 Pro Tips

  • Use multiple angles of the same subject for better feature extraction
  • Provide clear, high-resolution reference images
  • Specify style elements you want to maintain
  • For character series, include various expressions and poses in references

Price

  • $0.028 per run

📝 Example Workflows

WorkflowDescription
Character SeriesCreate consistent characters across different scenes
Product VariationsGenerate product images with unified branding
Comic CreationMaintain character identity across story panels
Style TransferApply consistent artistic style to new subjects

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/kwaivgi/kling-image-o1" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "aspect_ratio": "1:1",
    "resolution": "1k",
    "num_images": 1
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

ParameterTypeRequiredDefaultRangeDescription
promptstringYes-The positive prompt for the generation.
imagesarrayNo[]-Including reference images of the element, scene, style, etc.max 10
aspect_ratiostringNo1:116:9, 9:16, 1:1, 4:3, 3:4, 3:2, 2:3, 21:9, autoThe aspect ratio of the generated image.
resolutionstringNo1k1k, 2kImage generation resolution
num_imagesintegerNo11 ~ 9The number of images to generate.

Response Parameters

ParameterTypeDescription
codeintegerHTTP status code (e.g., 200 for success)
messagestringStatus message (e.g., “success”)
data.idstringUnique identifier for the prediction, Task Id
data.modelstringModel ID used for the prediction
data.outputsarrayArray of URLs to the generated content (empty when status is not completed)
data.urlsobjectObject containing related API endpoints
data.urls.getstringURL to retrieve the prediction result
data.has_nsfw_contentsarrayArray of boolean values indicating NSFW detection for each output
data.statusstringStatus of the task: created, processing, completed, or failed
data.created_atstringISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.errorstringError message (empty if no error occurred)
data.timingsobjectObject containing timing details
data.timings.inferenceintegerInference time in milliseconds

Result Request Parameters

© 2025 WaveSpeedAI. All rights reserved.