Grok Imagine Video Text-to-Video
Grok Imagine Video Text-to-Video is X-AI's text-to-video generation model that creates videos directly from text descriptions. Describe the scene, motion, and style you want — the model generates cinematic footage with realistic movement and atmosphere.
Why Choose This?
-
Pure text-driven generation
Create videos from scratch using only text descriptions.
-
Flexible duration
Generate videos of varying lengths based on your needs.
-
Multiple aspect ratios
Support for 16:9, 9:16, and other common formats.
-
Resolution options
Output in 480p or 720p based on your requirements.
-
Prompt Enhancer
Built-in tool to automatically improve your video descriptions.
Parameters
| Parameter | Required | Description |
|---|
| prompt | Yes | Text description of the video scene and motion |
| duration | No | Video length in seconds (default: 6) |
| aspect_ratio | No | Output ratio: 16:9, 9:16, etc. |
| resolution | No | Output resolution: 720p (default), 480p |
How to Use
- Write your prompt — describe the scene, motion, camera movement, and atmosphere in detail.
- Set duration — choose how long the video should be.
- Select aspect ratio — 16:9 for landscape, 9:16 for vertical content.
- Select resolution — 720p for quality, 480p for faster processing.
- Run — submit and download your video.
Pricing
| Duration | Cost |
|---|
| Per second | $0.05 |
Billing Rules
- Total cost = $0.05 × duration (in seconds)
Examples
- 5s video → $0.25
- 6s video → $0.30
- 10s video → $0.50
Best Use Cases
- Social Media Content — Generate short-form videos for TikTok, Reels, and Stories.
- Concept Visualization — Bring ideas to life without filming or stock footage.
- Marketing & Ads — Create promotional video content from descriptions.
- Storytelling — Generate narrative scenes for creative projects.
- Rapid Prototyping — Test video concepts before full production.
Pro Tips
- Use the Prompt Enhancer to refine your descriptions automatically.
- Be specific about camera movement (handheld, dolly, pan, zoom) for dynamic footage.
- Describe lighting, weather, and atmosphere for more realistic results.
- Match aspect ratio to your target platform: 16:9 for YouTube, 9:16 for TikTok/Reels.
- Include action details and timing for more controlled motion.
Notes
- Only prompt is required; other parameters have defaults.
- Ensure uploaded URLs are publicly accessible if referencing external content.
- For best results, write detailed prompts with scene, motion, and style information.
Related Models
- Grok Imagine Video Image-to-Video — Generate video from reference images.
- Grok Imagine Video Edit — Edit existing videos with text instructions.
- Grok Imagine Image Text-to-Image — Generate images from text prompts.