Kling Omni Video O1 — Reference-to-Video
Kling Omni Video O1 is Kuaishou's groundbreaking unified multi-modal video model. The Reference-to-Video mode creates new video content based on subject references — maintaining character, prop, and scene identity while generating entirely new creative scenarios.
Key Capabilities
Multi-Reference Subject Creation
Build subjects from multiple reference viewpoints:
- Extract features from character, prop, or scene images
- Maintain consistent identity in generated videos
- Create new scenarios with familiar subjects
Subject Consistency Technology
Advanced feature extraction ensures:
- Stable character appearance across all frames
- Consistent clothing, accessories, and props
- Maintained facial features and expressions
- Coherent scene elements and backgrounds
Creative Freedom
Generate entirely new content while preserving identity:
- New poses and actions
- Different scenes and environments
- Various camera angles and movements
- Fresh creative scenarios
Core Features
- Identity Lock — Subject features remain consistent throughout video
- Multi-Angle Support — Use references from various viewpoints
- Scene Flexibility — Place subjects in new environments
- Motion Control — Guide actions with text prompts
How to Use
-
Upload Reference Images
Provide one or more images of your subject (character, object, or scene).
-
Describe the Scenario
Write a prompt for the new video content.
Example: "The character walking through a futuristic city at night, neon lights reflecting on wet streets"
-
Set Parameters
Choose duration, resolution, and output format.
-
Generate
Receive video with your subject in the new scenario.
Pricing
| Reference Type | Price per Second |
|---|
| Image Reference | $0.112 |
| Video Reference | $0.168 |
$0.112/s for image reference only; $0.168/s when using video reference.
Pro Tips
- Use multiple reference angles for better identity capture
- Provide clear, high-resolution reference images
- Describe actions and environments clearly in prompts
- Works best for characters, products, and distinct objects
Note
- If the input reference parameters include a video, then the number of reference images that can be entered will be reduced to 4.