Wan 2.6 Image-to-Video Pro
Wan 2.6 Image-to-Video Pro is Alibaba's premium image-to-video model, transforming still images into cinematic video with superior motion quality and detail. Upload a reference image, describe the scene and motion — the model generates smooth, high-resolution video with optional audio input and flexible duration options.
Why Choose This?
-
Pro-tier quality
Superior visual fidelity and motion realism from Alibaba's latest Wan 2.6 architecture.
-
Multiple resolutions
Output in 1080p, 2K, or 4K to match your production needs.
-
Audio support
Optional audio input for synchronized video generation.
-
Shot type control
Choose between single or multi-shot compositions.
-
Prompt Enhancer
Built-in prompt optimizer for improved generation results.
-
Negative prompt support
Specify elements to exclude for more precise control.
Parameters
| Parameter | Required | Description |
|---|
| prompt | Yes | Text description of the desired scene and motion |
| image | Yes | Reference image to animate (URL or upload) |
| audio | No | Audio file for synchronized video (URL or upload) |
| negative_prompt | No | Elements to exclude from the video |
| resolution | No | Output resolution: 1080p (default), 2k, 4k |
| duration | No | Video length in seconds (default: 5) |
| shot_type | No | Shot composition: single (default) or multi |
| enable_prompt_expansion | No | Enable prompt optimizer (default: disabled) |
| seed | No | Random seed for reproducibility (-1 for random) |
How to Use
- Upload your image — provide the reference image to animate.
- Write your prompt — describe the scene, motion, camera movement, and mood in detail.
- Add audio (optional) — upload audio for synchronized video generation.
- Add negative prompt (optional) — specify elements you want to avoid.
- Choose resolution — select 1080p, 2K, or 4K based on your needs.
- Set duration — choose the desired video length.
- Select shot type — single for focused shots, multi for complex compositions.
- Enable prompt expansion (optional) — let the optimizer enhance your prompt.
- Run — submit and download your video.
Pricing
| Duration | 1080p | 2k | 4k |
|---|
| 5 s | $0.60 | $0.70 | $0.80 |
| 10 s | $1.20 | $1.40 | $1.60 |
| 15 s | $1.80 | $2.10 | $2.40 |
Billing Rules
- Base rate (1080p): $0.60 per 5 seconds
- 2K rate: $0.70 per 5 seconds
- 4K rate: $0.80 per 5 seconds
Best Use Cases
- Premium Production — High-resolution video requiring superior visual quality.
- Marketing & Ads — Cinematic promotional videos with professional polish.
- Music Videos — Synchronized video generation with audio input.
- E-commerce — Bring product images to life in stunning detail.
- Content Creation — Create engaging short-form videos for social media.
Pro Tips
- Use detailed, cinematic prompts for best results — include lighting, camera angles, and motion descriptions.
- Try the Prompt Enhancer (enable_prompt_expansion) to automatically refine your descriptions.
- Use negative_prompt to avoid common issues like blurry faces or unwanted elements.
- Add audio for music videos or content requiring synchronized sound.
- Use single shot_type for focused character or product animations, multi for complex scene compositions.
- Set a specific seed for reproducible results across multiple generations.
Notes
- Both prompt and image are required fields.
- Ensure uploaded image and audio URLs are publicly accessible.
- Higher resolutions (2K, 4K) produce better detail but cost more.
Related Models
- Wan 2.6 Text-to-Video — Generate videos from text prompts.
- Wan 2.6 Image-to-Video — Standard tier image-to-video at lower cost.