Vidu Contest
WaveSpeed.ai
Home/Explore/Kling O3 Models/kwaivgi/kling-video-o3-pro/image-to-video
image-to-video

image-to-video

Kling Omni Video O3 Image-To-Video

kwaivgi/kling-video-o3-pro/image-to-video

Kling Omni Video O3 Image-to-Video transforms static images into dynamic cinematic videos using MVL (Multi-modal Visual Language) technology. Maintains subject consistency while adding natural motion, physics simulation, and seamless scene dynamics. Supports audio generation. Ready-to-use REST API, best performance, no coldstarts, affordable pricing.

Input

Hint: You can drag and drop a file or click to upload

preview

Hint: You can drag and drop a file or click to upload

Whether to generate audio for the video.

Idle

Your request will cost $1.2 per run.

One more thing:

ExamplesView all

README

Kling Video O3 Pro Image-to-Video

Kling Video O3 Pro is Kuaishou's most powerful image-to-video model, delivering top-tier visual quality and cinematic motion. Upload a reference image and describe the scene — the model generates premium video with optional synchronized sound and start-to-end frame guidance. Supports flexible duration from 3 to 15 seconds.

Why Choose This?

  • O3 Pro quality The highest visual fidelity and motion realism in the Kling family.

  • Flexible duration Generate videos from 3 to 15 seconds — any length you need.

  • Start-end frame guidance Optional end image for controlled transitions between two frames.

  • Sound generation Optional synchronized sound effects generated alongside the video.

  • Prompt Enhancer Built-in tool to automatically improve your motion descriptions.

Parameters

ParameterRequiredDescription
promptYesText description of the desired motion and action
imageYesStart frame image to animate (URL or upload)
end_imageNoEnd frame image for guided transitions
durationNoVideo length: 3-15 seconds (default: 5)
soundNoGenerate synchronized sound (default: disabled)

How to Use

  1. Upload your image — provide the reference image to animate.
  2. Write your prompt — describe the motion, camera movement, and action.
  3. Upload end image (optional) — provide an end frame for guided transitions.
  4. Set duration — choose any length from 3 to 15 seconds.
  5. Enable sound (optional) — generate synchronized audio with the video.
  6. Run — submit and download your video.

Pricing

DurationSound OffSound On
3s$0.72$0.90
5s$1.20$1.50
10s$2.40$3.00
15s$3.60$4.50

Billing Rules

  • Base rate: $1.20 per 5 seconds
  • Sound multiplier: disabled = 1×, enabled = 1.25×

Best Use Cases

  • Premium Production — Cinematic scenes requiring the highest visual quality.
  • Long-Form Animation — Up to 15 seconds for extended scene development.
  • Scene Transitions — Use start and end frames for smooth cinematic transitions.
  • Professional Marketing — Premium promotional videos with sound.
  • Short Films — Film-quality animated scenes from still images.

Pro Tips

  • Use detailed, cinematic prompts for best results (slow-motion, lighting, camera angles).
  • Add an end_image for controlled transitions between two visual states.
  • Enable sound for campfires, rain, city ambience, and other environmental audio.
  • Use shorter durations (3-5s) for testing, longer (10-15s) for final production.
  • Use high-quality source images for the best video output.

Notes

  • Both prompt and image are required fields.
  • Duration supports any value from 3 to 15 seconds.
  • Sound generation increases cost by 1.25×.
  • Ensure uploaded image URLs are publicly accessible.

Related Models