Vidu Contest
WaveSpeed.ai
Início/Explorar/Kling Models/kwaivgi/kling-v3.0-std/image-to-video
image-to-video

image-to-video

Kling 3.0 Standard

kwaivgi/kling-v3.0-std/image-to-video

Kling 3.0 Standard delivers high-quality image-to-video generation with smooth motion, cinematic visuals, accurate prompt adherence, and native audio for ready-to-share clips. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

Input

Hint: You can drag and drop a file or click to upload

preview

Hint: You can drag and drop a file or click to upload

Whether sound is generated simultaneously when generating a video.

Idle

Sua solicitação custará $0.18 por execução.

Por $10 você pode executar este modelo aproximadamente 55 vezes.

Mais uma coisa::

ExemplosVer todos

README

Kling V3.0 Standard Image-to-Video

Kling V3.0 Standard Image-to-Video is Kuaishou's latest image-to-video generation model. Upload a reference image and describe the motion — the model generates cinematic video with optional synchronized sound, voice support, and start-to-end frame guidance.

Why Choose This?

  • Latest Kling generation V3.0 delivers improved motion quality and visual fidelity over V2.6.

  • Start-end frame guidance Optional end image for controlled transitions between two frames.

  • Sound generation Optional synchronized sound effects generated alongside the video.

  • Voice list support Add up to 2 custom voice entries for character dialogue.

  • CFG scale control Fine-tune the balance between prompt adherence and creative freedom.

Parameters

ParameterRequiredDescription
promptNoText description of the desired motion and action
negative_promptNoElements to exclude from generation
imageYesStart frame image to animate (URL or upload)
end_imageNoEnd frame image for guided transitions
durationNoVideo length: 5 or 10 seconds (default: 5)
cfg_scaleNoPrompt adherence strength (default: 0.5)
soundNoGenerate synchronized sound (default: disabled)
voice_listNoCustom voice entries, up to 2 (click "+ Add Item")

How to Use

  1. Upload your image — provide the reference image to animate.
  2. Write your prompt (optional) — describe the motion, camera movement, and action.
  3. Upload end image (optional) — provide an end frame for guided transitions.
  4. Add negative prompt (optional) — specify what you want to avoid.
  5. Set duration — 5 seconds or 10 seconds.
  6. Adjust cfg_scale (optional) — higher for stricter prompt following, lower for more freedom.
  7. Enable sound (optional) — generate synchronized audio with the video.
  8. Add voices (optional) — add up to 2 voice entries for dialogue.
  9. Run — submit and download your video.

Pricing

DurationSound OffSound On
5s$0.18$0.27
10s$0.36$0.54

Billing Rules

  • Sound multiplier: disabled = 1×, enabled = 1.5×

Best Use Cases

  • Photo Animation — Bring portraits, landscapes, and product images to life.
  • Scene Transitions — Use start and end frames for smooth visual transitions.
  • Social Media Content — Create engaging videos with sound from still images.
  • Marketing & Ads — Generate dynamic promotional videos from product photos.
  • Storytelling — Animate scenes with synchronized audio and dialogue.

Pro Tips

  • Use clear, descriptive prompts with specific motion details for best results.
  • Add an end_image for controlled transitions between two visual states.
  • Enable sound for a complete video experience with synchronized audio.
  • Use negative prompts to avoid artifacts (e.g., "blurry, low quality, distorted").
  • Lower cfg_scale for more creative variation, higher for strict prompt adherence.
  • Use high-quality source images for better video results.

Notes

  • Image is the only required field; prompt is optional but recommended.
  • Duration options are 5 or 10 seconds only.
  • Voice list supports a maximum of 2 entries.
  • Sound generation increases cost by 1.5×.
  • Ensure uploaded image URLs are publicly accessible.

Related Models