Kwaivgi Kling Video O1 Text To Video

Playground

Kling Omni Video O1 is Kuaishou’s first unified multi-modal video model with MVL (Multi-modal Visual Language) technology. Text-to-Video mode generates cinematic videos from text prompts with subject consistency, natural physics simulation, and precise semantic understanding. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

Features

Kling Omni Video O1 — Text-to-Video

Kling Omni Video O1 is Kuaishou’s groundbreaking unified multi-modal video model, representing the world’s first AI system that seamlessly integrates text, images, videos, and subject references into a single creative engine. The Text-to-Video mode transforms natural language prompts into stunning, cinematic video content.

🌟 Why Kling Video O1 Stands Out

Universal Creative Engine

Unlike traditional single-task models, Video O1 unifies multiple video generation capabilities:

Text-to-video generation
Image-to-video transformation
Reference-based video creation
Video editing and modification
Shot extension and scene continuation

The model interprets your instructions through a revolutionary MVL system that understands:

Natural language descriptions
Visual context and references
Subject identity and characteristics
Scene dynamics and physics

Subject Consistency

Maintains stable character, prop, and scene features across varying shots — similar to professional directing techniques used in film production.

🎬 Core Features

Cinematic Quality — Film-grade visual output with natural lighting and realistic motion
Physics Simulation — Accurate real-world physics for natural movement and dynamics
Semantic Understanding — Deep comprehension of complex prompts and creative intent
Flexible Outputs — Multiple resolution and duration options

🚀 How to Use

Write Your Prompt Describe the scene, action, camera movement, and mood you want.

Example: “A young woman walking through a neon-lit Tokyo street at night, rain reflecting city lights, cinematic tracking shot”
Set Parameters Choose your preferred duration, resolution, and aspect ratio.
Generate Submit your request and receive high-quality video output.

💰 Pricing

Item	Price
Per Second	$0.112

Billed per second of output video duration.

💡 Pro Tips

Use specific camera terms: “tracking shot,” “close-up,” “aerial view”
Describe lighting conditions: “golden hour,” “neon-lit,” “soft diffused light”
Include motion cues: “slowly walking,” “rapid zoom,” “gentle breeze”
Specify mood and atmosphere for better results

Authentication

For authentication details, please refer to the Authentication Guide.

API Endpoints

Submit Task & Query Result


# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/kwaivgi/kling-video-o1/text-to-video" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
    "aspect_ratio": "16:9",
    "duration": 5
}'

# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"

Parameters

Task Submission Parameters

Request Parameters

Parameter	Type	Required	Default	Range	Description
prompt	string	Yes		-	The positive prompt for the generation.
aspect_ratio	string	No	16:9	16:9, 9:16, 1:1	The aspect ratio of the generated video.
duration	integer	No	5	5, 10	The duration of the generated media in seconds.

Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data.id	string	Unique identifier for the prediction, Task Id
data.model	string	Model ID used for the prediction
data.outputs	array	Array of URLs to the generated content (empty when status is not `completed`)
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.has_nsfw_contents	array	Array of boolean values indicating NSFW detection for each output
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Result Request Parameters

Parameter	Type	Required	Default	Description
id	string	Yes	-	Task ID

Result Response Parameters

Parameter	Type	Description
code	integer	HTTP status code (e.g., 200 for success)
message	string	Status message (e.g., “success”)
data	object	The prediction data object containing all details
data.id	string	Unique identifier for the prediction, the ID of the prediction to get
data.model	string	Model ID used for the prediction
data.outputs	string	Array of URLs to the generated content (empty when status is not completed).
data.urls	object	Object containing related API endpoints
data.urls.get	string	URL to retrieve the prediction result
data.status	string	Status of the task: `created`, `processing`, `completed`, or `failed`
data.created_at	string	ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”)
data.error	string	Error message (empty if no error occurred)
data.timings	object	Object containing timing details
data.timings.inference	integer	Inference time in milliseconds

Kwaivgi Kling Video O1 Std Video Edit Kwaivgi Kling Video O1 Video Edit