
text-to-video
Idle
Your request will cost $0.2 per run.
For $10 you can run this model approximately 50 times.
One more thing::
Kandinsky 5 Pro Text-to-Video is a production-ready text-to-video model that generates dynamic 5-second MP4 clips from a single prompt. It’s optimized for fast iteration and clean, prompt-faithful motion, with simple controls for resolution and aspect ratio.
5-second text-to-video generation Turn a prompt into a complete short clip—ideal for rapid concept testing and social-ready outputs.
Two resolution tiers Choose 512P for faster, cheaper drafts or 1024P for sharper detail.
Creator-friendly aspect ratios Built-in framing for 3:2, 1:1, and 2:3 to match common feed and creative formats.
Fast, stable inference Designed for predictable performance in real-world pipelines and batch experimentation.
| Parameter | Description |
|---|---|
| prompt* | The text prompt describing subject, action, scene, and style. |
| resolution | Output resolution: 512P (default) or 1024P. |
| aspect_ratio | Output aspect ratio: 3:2 (default), 1:1, or 2:3. |
| duration | Fixed at 5 seconds. |
All videos are 5 seconds.
| Resolution | Price per second | Price per 5s video |
|---|---|---|
| 512P | $0.04 | $0.20 |
| 1024P | $0.12 | $0.60 |