Explore/Wan 2.5 Video Models

Wan 2.5 Video Models

alibaba/wan-2.5/text-to-image
alibaba/wan-2.5/text-to-image

alibaba

$0.03

wan-2.5/text-to-image

alibaba/wan-2.5/image-to-video

alibaba

$0.25

wan-2.5/image-to-video

alibaba/wan-2.5/text-to-video

alibaba

$0.25

wan-2.5/text-to-video

alibaba/wan-2.5/image-to-video-fast

alibaba

$0.17

wan-2.5/image-to-video-fast

alibaba/wan-2.5/text-to-video-fast

alibaba

$0.17

wan-2.5/text-to-video-fast

Alibaba WAN 2.5 Model

Alibaba WAN 2.5 is a state-of-the-art text/image-to-video generation model available on Alibaba Cloud's DashScope platform. This powerful model produces high-quality videos in 480p, 720p, or 1080p resolution, complete with synchronized audio, from simple text or image prompts. Compared to Google Veo3, it offers a more affordable solution with faster video generation speeds, making it an excellent choice for creating audio-embedded videos efficiently.

Model list

wan-2.5/text-to-video

wan-2.5/image-to-video

wan-2.5/text-to-video-fast

wan-2.5/image-to-video-fast

wan-2.5/text-to-image


Veo3 VS Wan 2.5 effect compare


What makes Wan 2.5 stand out?

More affordable

Although Google recently announced price cuts, Veo 3 still remains costly overall.

In contrast, Wan 2.5 is leaner and more budget-friendly, offering creators more options while significantly reducing production costs.

One-pass outputs with end-to-end A/V sync

With Wan 2.5, you no longer need to record separate voiceovers or manually align lips for silent AI videos.

Just give a clear, well-structured prompt to generate a complete video with audio/voiceover and lip-sync all at once. The process becomes faster and simpler.

Multilingual friendly

When prompts are in Chinese or Minor languages, Wan 2.5 reliably produces A/V-synchronized videos.

Compared to Veo 3, it often displays “unknown language” when the prompt includes Chinese or other languages.

Longer duration & more video size options

Length: Veo 3 maxes out at about 8 seconds; Wan 2.5 supports up to 10 seconds, providing more space for storytelling.Formats: Veo 3 offers only one aspect ratio option, while Wan 2.5 supports three different video sizes to accommodate popular platforms and scenarios, enhancing publishing flexibility.

Voice-driven reference & original sound video

Veo 3 does not support audio reference, limiting creators to silent clips or system-generated sound. In contrast, Wan 2.5 allows direct input of voice, sound effects, and background music, driving the video generation with precise audio cues.