WaveSpeed.ai
/탐색/Kling O1 Models/kwaivgi/kling-video-o1/reference-to-video
image-to-video

image-to-video

Kling Omni Video O1

kwaivgi/kling-video-o1/reference-to-video

Kling Omni Video O1 Reference-to-Video generates creative videos using character, prop, or scene references from multiple viewpoints. Extracts subject features and creates new video content while maintaining identity consistency across frames. Ready-to-use REST API, best performance, no cold starts, affordable pricing.

Input

Hint: You can drag and drop a file or click to upload

Hint: You can drag and drop a file or click to upload

preview

Hint: You can drag and drop a file or click to upload

preview
Select whether to keep the video original sound through the parameter

Idle

이 요청에는 $0.56 실행당가 필요합니다.

$10으로 이 모델을 약 17회 실행할 수 있습니다.

추가 안내::

예시전체 보기

README

Kling Omni Video O1 — Reference-to-Video

Kling Omni Video O1 is Kuaishou's groundbreaking unified multi-modal video model. The Reference-to-Video mode creates new video content based on subject references — maintaining character, prop, and scene identity while generating entirely new creative scenarios.

Key Capabilities

Multi-Reference Subject Creation

Build subjects from multiple reference viewpoints:

  • Extract features from character, prop, or scene images
  • Maintain consistent identity in generated videos
  • Create new scenarios with familiar subjects

Subject Consistency Technology

Advanced feature extraction ensures:

  • Stable character appearance across all frames
  • Consistent clothing, accessories, and props
  • Maintained facial features and expressions
  • Coherent scene elements and backgrounds

Creative Freedom

Generate entirely new content while preserving identity:

  • New poses and actions
  • Different scenes and environments
  • Various camera angles and movements
  • Fresh creative scenarios

Core Features

  • Identity Lock — Subject features remain consistent throughout video
  • Multi-Angle Support — Use references from various viewpoints
  • Scene Flexibility — Place subjects in new environments
  • Motion Control — Guide actions with text prompts

How to Use

  1. Upload Reference Images Provide one or more images of your subject (character, object, or scene).

  2. Describe the Scenario Write a prompt for the new video content.

    Example: "The character walking through a futuristic city at night, neon lights reflecting on wet streets"

  3. Set Parameters Choose duration, resolution, and output format.

  4. Generate Receive video with your subject in the new scenario.

Pricing

Reference TypePrice per Second
Image Reference$0.112
Video Reference$0.168

$0.112/s for image reference only; $0.168/s when using video reference.

Pro Tips

  • Use multiple reference angles for better identity capture
  • Provide clear, high-resolution reference images
  • Describe actions and environments clearly in prompts
  • Works best for characters, products, and distinct objects

Note

  • If the input reference parameters include a video, then the number of reference images that can be entered will be reduced to 4.