Kwaivgi Kling Video O3 Pro Video Edit
Playground
Try it on WavespeedAI!Kling Omni Video O3 Video-Edit enables conversational video editing through natural language commands. Remove objects, change backgrounds, modify styles, adjust weather/lighting, and transform scenes with simple text instructions like ‘remove pedestrians’ or ‘change daytime to dusk’. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.
Features
Kling Video O3 Pro Video Edit
Kling Video O3 Pro Video Edit is Kuaishou’s most advanced video editing model, enabling natural-language-driven edits on existing video footage. Upload a video, describe the change you want — swap objects, alter scenes, shift styles — and get high-quality edited results with preserved motion and structure. Supports up to 4 reference images for precise visual guidance and optional original audio retention.
Why Choose This?
-
Prompt-driven editing Describe your edits in plain language — no timeline, no masks, no manual keyframing required.
-
Reference image support Attach up to 4 reference images to guide the target element, scene, or style in the output.
-
Audio preservation Keep the original soundtrack intact with the keep_original_sound option.
-
Scene-level understanding The model recognizes objects, backgrounds, and context within the video to apply accurate, context-aware edits.
-
Motion-consistent output Edits blend naturally across frames with strong temporal coherence — minimal flicker or ghosting.
Parameters
| Parameter | Required | Description |
|---|---|---|
| prompt | Yes | Text description of the desired edit |
| video | Yes | Input video to edit (URL or upload) |
| images | No | Up to 4 reference images for element, scene, or style guidance |
| keep_original_sound | No | Whether to keep the original sound from the video (default: enabled) |
How to Use
- Upload your video — drag-and-drop, file upload, or paste a public URL.
- Write your prompt — describe exactly what should change (e.g., “Change the beer to Cola.”).
- Add reference images (optional) — attach up to 4 images to steer the look of elements or styles.
- Set audio preference — toggle keep_original_sound to preserve or remove original audio.
- Run — submit and download the edited video.
Pricing
| Billed Duration | Cost |
|---|---|
| 3 s (minimum) | $1.08 |
| 5 s | $1.80 |
| 8 s | $2.88 |
| 10 s (maximum) | $3.60 |
Billing Rules
- Base rate: $1.80 per 5 seconds
- Minimum charge: 3 seconds
- Maximum billable duration: 10 seconds
Best Use Cases
- Social Media Campaigns — Quickly swap products, backgrounds, or props in short-form videos.
- Brand & Marketing — Replace or update branded elements across video assets without reshooting.
- E-commerce — Edit product videos to showcase different variants, colors, or settings from a single source clip.
- Creative Exploration — Experiment with style changes, scene swaps, and visual concepts on existing footage.
- Storytelling & Film — Adjust scene details, atmosphere, or objects to refine narrative visuals in post-production.
Pro Tips
- Use clear, specific prompts describing exactly what should change for best results.
- Reference images work best when they clearly represent the target element or style.
- Keep keep_original_sound enabled when audio continuity matters for your project.
- Test edits on shorter clips first, then apply to longer footage once satisfied.
- Ensure video URLs are publicly accessible — a preview thumbnail in the interface confirms the link works.
Notes
- Both prompt and video are required fields.
- Billed duration is clamped between 3 and 10 seconds regardless of actual video length.
- If using a URL, make sure it is publicly accessible.
Related Models
- Kling Video O3 Pro Text-to-Video — Generate videos from text prompts with O3 Pro’s highest quality output.
- Kling Video O3 Std Image-to-Video — Animate still images into video with the cost-efficient O3 Standard model.
- Kling Video O3 Pro Reference-to-Video — Create videos guided by reference images for consistent character and style control.
Authentication
For authentication details, please refer to the Authentication Guide.
API Endpoints
Submit Task & Query Result
# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/kwaivgi/kling-video-o3-pro/video-edit" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
"keep_original_sound": true
}'
# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"
Parameters
Task Submission Parameters
Request Parameters
| Parameter | Type | Required | Default | Range | Description |
|---|---|---|---|---|---|
| prompt | string | Yes | - | The positive prompt for the generation. | |
| video | string | Yes | - | The video URL. Video duration can not be longer than 10s. | |
| images | array | No | [] | - | Including reference images of the element, scene, style, etc. Max 4 |
| keep_original_sound | boolean | No | true | - | Whether to keep the original sound from the video. |
Response Parameters
| Parameter | Type | Description |
|---|---|---|
| code | integer | HTTP status code (e.g., 200 for success) |
| message | string | Status message (e.g., “success”) |
| data.id | string | Unique identifier for the prediction, Task Id |
| data.model | string | Model ID used for the prediction |
| data.outputs | array | Array of URLs to the generated content (empty when status is not completed) |
| data.urls | object | Object containing related API endpoints |
| data.urls.get | string | URL to retrieve the prediction result |
| data.has_nsfw_contents | array | Array of boolean values indicating NSFW detection for each output |
| data.status | string | Status of the task: created, processing, completed, or failed |
| data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
| data.error | string | Error message (empty if no error occurred) |
| data.timings | object | Object containing timing details |
| data.timings.inference | integer | Inference time in milliseconds |
Result Request Parameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
| id | string | Yes | - | Task ID |
Result Response Parameters
| Parameter | Type | Description |
|---|---|---|
| code | integer | HTTP status code (e.g., 200 for success) |
| message | string | Status message (e.g., “success”) |
| data | object | The prediction data object containing all details |
| data.id | string | Unique identifier for the prediction, the ID of the prediction to get |
| data.model | string | Model ID used for the prediction |
| data.outputs | string | Array of URLs to the generated content (empty when status is not completed). |
| data.urls | object | Object containing related API endpoints |
| data.urls.get | string | URL to retrieve the prediction result |
| data.status | string | Status of the task: created, processing, completed, or failed |
| data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
| data.error | string | Error message (empty if no error occurred) |
| data.timings | object | Object containing timing details |
| data.timings.inference | integer | Inference time in milliseconds |