
text-to-video
Idle
Your request will cost $0.5 per run.
For $10 you can run this model approximately 20 times.
One more thing::
WAN 2.6 Text-to-Video is Alibaba’s WanXiang 2.6 model that turns a pure text prompt (optionally with audio) into a 5–15s cinematic clip. It supports multi-shot storytelling, vertical or landscape formats, and resolutions up to 1080p, making it a strong fit for ads, trailers, and social content.
prompt* – Main description of the video: scene, characters, motion, camera moves, style.
negative_prompt – Things to avoid (e.g. watermark, text, distortion, extra limbs).
audio (optional) – URL or file of an audio track; reserved for advanced workflows where you want to align motion with existing sound.
size – Resolution presets:
720p tier
1080p tier
duration – One of 5s, 10s, 15s.
shot_type –
enable_prompt_expansion – If enabled, WAN 2.6 first expands your prompt into an internal, more detailed script before generating.
seed – Random seed; set to -1 for different results each time or use a fixed integer for reproducible motion/layout.
Output: an MP4 video at the chosen resolution and orientation.
Pricing depends on duration and resolution tier:
| Resolution | 5 s | 10 s | 15 s |
|---|---|---|---|
| 720p | $0.50 | $1.00 | $1.50 |
| 1080p | $0.75 | $1.50 | $2.25 |
-1 for variation) and click Run to generate your clip.kwaivgi/kling-video-o1/text-to-video Kwaivgi’s cinematic text-to-video model, great for character-driven scenes, smooth camera moves, and short-form storytelling.
alibaba/wan-2.5/text-to-video Alibaba’s WAN 2.5 prompt-to-video engine, focused on fast, coherent ads, explainers, and product demos.
google/veo3.1/text-to-video Google Veo 3.1 text-to-video, tuned for crisp compositions, filmic motion, and marketing-ready visuals.
openai/sora-2/text-to-video OpenAI Sora 2, a high-end text-to-video generator for long, detailed, physics-aware scenes and premium creative content.