Kimi k2.5 giftKimi Contest
WaveSpeed.ai
Home/Explore/Wan 2.6 Models/alibaba/wan-2.6/image-to-video-pro
image-to-video

image-to-video

Alibaba WAN 2.6 Image-To-Video Pro

alibaba/wan-2.6/image-to-video-pro

Alibaba WAN 2.6 Image-to-Video Pro converts images into premium-quality videos with superior motion dynamics, enhanced visual fidelity, and professional cinematic output. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Input

Hint: You can drag and drop a file or click to upload

preview

Hint: You can drag and drop a file or click to upload

If set to true, the prompt optimizer will be enabled.

Idle

Your request will cost $0.6 per run.

For $10 you can run this model approximately 16 times.

One more thing:

ExamplesView all

README

Wan 2.6 Image-to-Video Pro

Wan 2.6 Image-to-Video Pro is Alibaba's premium image-to-video model, transforming still images into cinematic video with superior motion quality and detail. Upload a reference image, describe the scene and motion — the model generates smooth, high-resolution video with optional audio input and flexible duration options.

Why Choose This?

  • Pro-tier quality Superior visual fidelity and motion realism from Alibaba's latest Wan 2.6 architecture.

  • Multiple resolutions Output in 1080p, 2K, or 4K to match your production needs.

  • Audio support Optional audio input for synchronized video generation.

  • Shot type control Choose between single or multi-shot compositions.

  • Prompt Enhancer Built-in prompt optimizer for improved generation results.

  • Negative prompt support Specify elements to exclude for more precise control.

Parameters

ParameterRequiredDescription
promptYesText description of the desired scene and motion
imageYesReference image to animate (URL or upload)
audioNoAudio file for synchronized video (URL or upload)
negative_promptNoElements to exclude from the video
resolutionNoOutput resolution: 1080p (default), 2k, 4k
durationNoVideo length in seconds (default: 5)
shot_typeNoShot composition: single (default) or multi
enable_prompt_expansionNoEnable prompt optimizer (default: disabled)
seedNoRandom seed for reproducibility (-1 for random)

How to Use

  1. Upload your image — provide the reference image to animate.
  2. Write your prompt — describe the scene, motion, camera movement, and mood in detail.
  3. Add audio (optional) — upload audio for synchronized video generation.
  4. Add negative prompt (optional) — specify elements you want to avoid.
  5. Choose resolution — select 1080p, 2K, or 4K based on your needs.
  6. Set duration — choose the desired video length.
  7. Select shot type — single for focused shots, multi for complex compositions.
  8. Enable prompt expansion (optional) — let the optimizer enhance your prompt.
  9. Run — submit and download your video.

Pricing

Duration1080p2k4k
5 s$0.60$0.70$0.80
10 s$1.20$1.40$1.60
15 s$1.80$2.10$2.40

Billing Rules

  • Base rate (1080p): $0.60 per 5 seconds
  • 2K rate: $0.70 per 5 seconds
  • 4K rate: $0.80 per 5 seconds

Best Use Cases

  • Premium Production — High-resolution video requiring superior visual quality.
  • Marketing & Ads — Cinematic promotional videos with professional polish.
  • Music Videos — Synchronized video generation with audio input.
  • E-commerce — Bring product images to life in stunning detail.
  • Content Creation — Create engaging short-form videos for social media.

Pro Tips

  • Use detailed, cinematic prompts for best results — include lighting, camera angles, and motion descriptions.
  • Try the Prompt Enhancer (enable_prompt_expansion) to automatically refine your descriptions.
  • Use negative_prompt to avoid common issues like blurry faces or unwanted elements.
  • Add audio for music videos or content requiring synchronized sound.
  • Use single shot_type for focused character or product animations, multi for complex scene compositions.
  • Set a specific seed for reproducible results across multiple generations.

Notes

  • Both prompt and image are required fields.
  • Ensure uploaded image and audio URLs are publicly accessible.
  • Higher resolutions (2K, 4K) produce better detail but cost more.

Related Models