WaveSpeed.ai
Home/Explore/Grok Models/x-ai/grok-imagine-video/text-to-video
text-to-video

text-to-video

X-AI Grok Imagine Video

x-ai/grok-imagine-video/text-to-video

X-AI Grok Imagine Video generates videos from text descriptions using xAI's Grok Imagine Video model. Create high-quality videos with customizable duration, aspect ratio, and resolution. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Input

Idle

Your request will cost $0.05 per run.

For $1 you can run this model approximately 20 times.

One more thing::

ExamplesView all

README

Grok Imagine Video Text-to-Video

Grok Imagine Video Text-to-Video is X-AI's text-to-video generation model that creates videos directly from text descriptions. Describe the scene, motion, and style you want — the model generates cinematic footage with realistic movement and atmosphere.

Why Choose This?

  • Pure text-driven generation Create videos from scratch using only text descriptions.

  • Flexible duration Generate videos of varying lengths based on your needs.

  • Multiple aspect ratios Support for 16:9, 9:16, and other common formats.

  • Resolution options Output in 480p or 720p based on your requirements.

  • Prompt Enhancer Built-in tool to automatically improve your video descriptions.

Parameters

ParameterRequiredDescription
promptYesText description of the video scene and motion
durationNoVideo length in seconds (default: 6)
aspect_ratioNoOutput ratio: 16:9, 9:16, etc.
resolutionNoOutput resolution: 720p (default), 480p

How to Use

  1. Write your prompt — describe the scene, motion, camera movement, and atmosphere in detail.
  2. Set duration — choose how long the video should be.
  3. Select aspect ratio — 16:9 for landscape, 9:16 for vertical content.
  4. Select resolution — 720p for quality, 480p for faster processing.
  5. Run — submit and download your video.

Pricing

DurationCost
Per second$0.05

Billing Rules

  • Total cost = $0.05 × duration (in seconds)

Examples

  • 5s video → $0.25
  • 6s video → $0.30
  • 10s video → $0.50

Best Use Cases

  • Social Media Content — Generate short-form videos for TikTok, Reels, and Stories.
  • Concept Visualization — Bring ideas to life without filming or stock footage.
  • Marketing & Ads — Create promotional video content from descriptions.
  • Storytelling — Generate narrative scenes for creative projects.
  • Rapid Prototyping — Test video concepts before full production.

Pro Tips

  • Use the Prompt Enhancer to refine your descriptions automatically.
  • Be specific about camera movement (handheld, dolly, pan, zoom) for dynamic footage.
  • Describe lighting, weather, and atmosphere for more realistic results.
  • Match aspect ratio to your target platform: 16:9 for YouTube, 9:16 for TikTok/Reels.
  • Include action details and timing for more controlled motion.

Notes

  • Only prompt is required; other parameters have defaults.
  • Ensure uploaded URLs are publicly accessible if referencing external content.
  • For best results, write detailed prompts with scene, motion, and style information.

Related Models