Alibaba WAN 2.5 — Video Extend Model
Alibaba WAN 2.5 is an advanced video extension model available on Alibaba Cloud DashScope, designed to extend or complete video clips while maintaining perfect motion continuity, style consistency, and A/V synchronization. It supports custom durations from 3 to 10 seconds and resolutions up to 1080p.
✨ Why it’s special
- Seamless extension: Extends videos naturally — no abrupt cuts or looping artifacts. The motion flow and scene lighting stay coherent frame to frame.
- Flexible duration control: Choose any target length from 3 s – 10 s (3, 4, 5, 6, 7, 8, 9, 10 s all supported).
- One-pass A/V sync: Automatically aligns audio and mouth movements, ensuring natural speech and rhythm without manual editing.
- Multilingual friendly: Supports prompts in Chinese and English for synchronized video generation with accurate lip-sync.
- Custom audio option: You can upload your own audio (music or voice) or let the model generate one for you — plug-and-play.
- Lightweight and affordable: Built for creators and teams who need fast, consistent, and cost-effective video outputs.
🧩 Designed For
- Content creators & storytellers: Extend scenes smoothly to fit a beat, dialogue, or camera motion.
- Marketing teams: Produce multiple ad variations from a single core clip — faster, cheaper, consistent.
- Corporate training teams: Expand short clips into complete learning segments without reshooting.
- Global enterprises: Generate localized, multilingual video extensions for different audiences.
⚙️ How to Use
- Upload your base video clip.
- (Optional) Upload an audio file (
.wav
/ .mp3
, ≤ 15 MB, 3–30 s).
- Enter your prompt or description (supports Chinese / English).
- Select the output duration (3 s – 10 s).
- Choose resolution (480p / 720p / 1080p).
- Submit the job — wait a few moments, preview, and download your extended video.
💡 Audio Handling
- If audio > chosen video length: Only the first segment (3 s, 5 s, 10 s etc.) is used.
- If audio < chosen video length: Remaining video plays silently.
- If no audio: WAN 2.5 can auto-generate one or leave it silent.
💰 Pricing
Resolution | Price per second | Example (3 s) | Example (5 s) |
---|
480p | $0.05 | $0.15 | $0.25 |
720p | $0.10 | $0.30 | $0.50 |
1080p | $0.15 | $0.45 | $0.75 |
🧠 Pro Tips
- Start with 5 s for testing — balance between cost and quality.
- Upload high-quality input videos to maximize motion consistency.
- Keep your prompt concise but descriptive: include mood, motion, and scene intent.
- For multilingual videos, specify the desired spoken language in your prompt.
📎 Note
- If you didn’t upload your image or video locally, please ensure that the media URL is publicly accessible. A successfully accessible file will display a preview in the interface.