Kling O1 Series Officially Launches on WaveSpeedAI — A New Standard for Unified Image & Video Creation

WaveSpeedAI,

Introducing the Kling O1 Series

WaveSpeedAI is excited to officially launch the Kling O1 Series, a next-generation multimodal creation family built on the concept of Multi-modal Visual Language (MVL). The series consists of two powerful models:

  • Kling Image O1 — an all-scene image creation and editing model
  • Kling Video O1 — the world’s first unified multimodal video model

Together, they bring creators a complete visual production engine capable of handling text, images, subjects, and video inputs with exceptional consistency and creative flexibility.


Kling Image O1 — Advanced Image Creation Across All Scenarios

Designed to remove friction from the entire image creation pipeline, Image O1 combines text-to-image, multi-reference fusion, fine-grained editing, and high-fidelity style transfer into one seamless workflow.

Below are its core highlights, each with an optional example section you may fill later.

High Feature Consistency Across Up to 10 References

Image O1 can extract and maintain stable characteristics across as many as 10 reference images, preserving:

  • Identity
  • Object structure
  • Color tone
  • Visual silhouette
  • Global style direction

This is especially powerful for IP character design, comic frame consistency, brand visual systems, and series-based conceptual art.


Precision Editing via Natural Language

Without needing masks or manual retouching, Image O1 can modify:

  • Objects
  • Characters
  • Colors
  • Backgrounds
  • Materials

…all while maintaining original lighting, shadows, and texture integrity.

Prompt:Change the material of the table to stone.

Original image

Edited image with stone table


Faithful Style Interpretation & Transfer

From felt textures to 3D figurine looks to niche art styles, Image O1 can deeply analyze:

  • Brushstroke patterns
  • Palette structure
  • Composition logic

…delivering natural and coherent style transformations.

Prompt:Convert the picture into a Lego style.

Original image

Lego style transfer


Rich Imagination & Multi-Reference Fusion

Image O1 supports hybrid creation flows such as:

  • Sketch + text
  • Reference + style change
  • Multi-subject fusion
  • Layout reinterpretation

It blends sources naturally without producing the usual “cut-and-paste” look.

Prompt:Change the picture to an overhead view.

Original image

Overhead view generated


Kling Video O1 — The World’s First Unified Multimodal Video Model

Kling Video O1 brings a breakthrough approach to video creation by merging multiple tasks inside one unified model—no mode switching, no fragmented editing steps.

All-in-One Creative Engine

Video O1 unifies tasks that previously required separate tools:

  • Text-to-video
  • Reference-based video generation
  • First/last-frame video creation
  • Video editing, enhancement, and deletion
  • Style rewriting
  • Shot extension

Creators can now move from idea → generation → editing within one continuous experience.


Multi-Modal Input, Multi-Modal Command

Video O1 treats all inputs as one instruction system:

  • Images
  • Video clips
  • Subjects (multi-angle reference)
  • Natural language

You can simply say:

  • “Remove the passerby,”
  • “Change daytime to dusk,”
  • “Replace the outfit,”

…and the model performs pixel-level semantic reconstruction automatically.


Industrial-Level Visual Consistency

Video O1 strengthens understanding of:

  • Identity
  • Motion
  • Scene logic
  • Multi-subject interactions

Whether a video involves a single speaker or a complex group shot, each character remains stable across frames, angles, and scene changes.


Creative Skill Combinations

Video O1 supports complex hybrid workflows:

  • Add a subject while changing style
  • Use image references while modifying backgrounds
  • Generate next-shot motion based on a video reference

Creative chemistry becomes limitless.


Narrative Control: 3–10 Second Generation

Creators can freely define shot pacing—from short impact moments to longer thematic scenes.


Why Kling O1 Matters

The Kling O1 series represents a new paradigm: one family of models capable of covering every stage of visual creation—image and video alike. Whether you’re a designer, filmmaker, brand team, or solo creator, Kling O1 unlocks new heights of creative efficiency.

It is:

  • Multi-modal
  • Consistent
  • Scalable
  • Creator-friendly
  • Production-ready

FAQ about Kling O1

1. What is the Kling O1 Series? A new Kling suite for advanced image and video creation in your browser.

2. What tools are included? Kling Image O1 and Kling Video O1.

3. Do I need special hardware? No, it works fully online.

4. Can it make both images and videos? Yes—images with Image O1 and videos with Video O1.

5. Is it beginner-friendly? Yes, with simple controls and pro-level features.

6. Can I use it commercially? Yes, depending on your WaveSpeedAI plan.


Conclusion

The Kling O1 Series brings creators a new level of consistency, control, and multimodal intelligence across both images and videos. It makes creative work smoother, more unified, and far more efficient—whether you’re generating, editing, or building full visual stories.

And with WaveSpeedAI, getting started is effortless. No downloads, no setup—just open your browser and create.

👉 Try Kling Image O1 and Kling Video O1 on WaveSpeedAI today.


Stay connected with us

Discord Community | X (Twitter) | Open Source Projects | Instagram

© 2025 WaveSpeedAI. All rights reserved.