Minimax Music 2.5Vidu Contest
WaveSpeed.ai
Home/Explore/Vidu Models/vidu/text-to-video
text-to-video

text-to-video

Vidu Text To Video

vidu/text-to-video

Vidu Text to Video converts text prompts into high-quality 720p videos with exceptional visual fidelity and diverse motion dynamics. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Input

Idle

Your request will cost $0.2 per run.

For $10 you can run this model approximately 50 times.

One more thing:

ExamplesView all

README

๐ŸŽฌ Vidu Text-to-Video

Vidu Text-to-Video transforms your text prompts into high-quality, cinematic 720p videos โ€” complete with expressive motion, dynamic lighting, and natural camera movement. Built for creators, storytellers, and developers, Vidu delivers smooth, detailed, and visually coherent motion sequences directly from text input.

๐ŸŒŸ Why it looks amazing

  • Cinematic generation: produces film-like shots with realistic motion and depth of field.
  • High temporal consistency: ensures clean transitions with minimal flicker or distortion.
  • Motion diversity: supports subtle gestures, expressive movement, and camera motion for natural flow.
  • Text-guided direction: interprets complex scene descriptions with strong semantic alignment.
  • 720p output: optimized for crisp detail and creative flexibility.

โš™๏ธ Parameters and Controls

  • prompt โ€” describe your scene (e.g., โ€œA cat walking through a neon-lit alley at nightโ€).

  • movement_amplitude โ€” control motion strength:

    • auto (default): model decides best motion level.
    • small: subtle, gentle movements (good for portraits or still scenes).
    • medium: balanced camera and subject motion.
    • large: cinematic, dramatic, or action-heavy movement.
  • seed โ€” set for reproducible results (leave blank for random).

๐Ÿ’ฐ Pricing

ResolutionDurationCost per Clip
720p4s$0.20

๐Ÿš€ How to Use

  1. โœ๏ธ Write your prompt describing the scene or action.
  2. ๐ŸŽฅ Choose your movement amplitude (auto, small, medium, or large).
  3. ๐ŸŽฒ Optionally set a seed for reproducibility.
  4. โ–ถ๏ธ Click Run ($0.20) to generate your video.

๐Ÿ’ก Pro Tips

  • Use vivid verbs and adjectives for better motion understanding.
  • For subtle portraits, pick small amplitude; for dynamic shots, choose large.
  • Keep your prompt concise but descriptive โ€” focus on subject, setting, and style.
  • Adjust seed values to explore different visual interpretations of the same prompt.