Wan 2.1 T2V Plus 720P API

ExamplesView all

Related Models

wan-2.1/v2v-720p-lora

lora-support

wan-2.1/v2v-720p-ultra-fast

video-to-video

wan-2.1/v2v-480p-ultra-fast

video-to-video

wan-2.1/v2v-720p

video-to-video

wan-2.1/v2v-480p-lora-ultra-fast

lora-support

wan-2.1/v2v-720p-lora-ultra-fast

lora-support

README

WAN 2.1 — Text-to-Video Plus Model (720p)

WAN 2.1 T2V Plus is an advanced text-to-video generation model powered by Cloud’s Mixture of Experts (MoE) architecture. It creates cinematic 5-second 720p videos with natural motion, balanced lighting, and smooth transitions — optimized for speed, stability, and storytelling flexibility.

Why it looks great

Cinematic control: captures lighting, color tone, and depth for professional-grade visuals.
Smooth temporal motion: ensures coherent motion flow between subjects and background.
Prompt accuracy: delivers faithful interpretation of detailed text descriptions.
Optimized 720p efficiency: achieves excellent quality at faster inference and lower cost.
Stable rendering: minimizes flicker, distortion, or structure shifts during animation.

Pricing

Duration	Resolution	Cost per job
5 s	720p	$0.70

How to Use

Write Prompt – describe the desired scene, environment, and camera movement.
Choose Size – select landscape (1280×720) or portrait (720×1280).
(Optional) Add a Negative Prompt to exclude unwanted elements.
(Optional) Set Seed for reproducibility.
Run – preview and download your generated 5-second clip.

Pro Tips

Include motion cues (e.g., “camera panning,” “soft breeze,” “car moving through city lights”).
Use portrait mode for short-form social content, landscape for cinematic or presentation use.
Keep prompts focused and clear for best visual alignment and stable motion.

Notes

Please verify that your prompt and parameters are set correctly before running.
If results appear inconsistent, try a new seed value or simplify your prompt.