Giảm 50% mô hình Vidu Q3 & Q3 Pro · Chỉ trên WaveSpeedAI | 20/5 – 2/6
Grok Models

Grok Models

xAI's most advanced AI models with real-time knowledge and vision capabilities.

xAI's most advanced AI models with real-time knowledge and vision capabilities.

All models

10 models
x-ai/grok-imagine-image-quality/text-to-image
text-to-image

x-ai/grok-imagine-image-quality/text-to-image

xAI Grok Imagine Image Quality Text to Image is a fast AI image generation model that creates high-quality images from text prompts using a dedicated RunPod workflow. Ready-to-use REST inference API for creative design, concept art, marketing visuals, product mockups, social media content, and professional AI image generation workflows with simple integration, no coldstarts, and affordable pricing.

x-ai/grok-imagine-image-quality/edit
image-to-image

x-ai/grok-imagine-image-quality/edit

xAI Grok Imagine Image Quality Edit is a fast AI image editing model that edits and enhances images with high-quality visual output using a dedicated RunPod workflow. Ready-to-use REST inference API for photo retouching, creative image edits, product image enhancement, marketing assets, social media visuals, and professional AI image editing workflows with simple integration, no coldstarts, and affordable pricing.

x-ai/grok-2-image
text-to-image

x-ai/grok-2-image

Grok 2 Image is xAI’s latest image generation model that turns simple text prompts into sharp, photorealistic visuals in seconds. From product shots to social posts and concept art, it follows your instructions closely so you can go from idea to production-ready image with just one prompt. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

x-ai/grok-imagine-image/edit
image-to-image

x-ai/grok-imagine-image/edit

X-AI Grok Imagine Image enables precise image editing with xAI's Grok Imagine model. Transform and modify images using text prompts with AI-powered precision. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

x-ai/grok-imagine-image/text-to-image
text-to-image

x-ai/grok-imagine-image/text-to-image

X-AI Grok Imagine Image enables precise image editing with xAI's Grok Imagine model. Transform and modify images using text prompts with AI-powered precision. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

x-ai/grok-imagine-video/text-to-video
text-to-video

x-ai/grok-imagine-video/text-to-video

X-AI Grok Imagine Video generates videos from text descriptions using xAI's Grok Imagine Video model. Create high-quality videos with customizable duration, aspect ratio, and resolution. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

x-ai/grok-imagine-video/image-to-video
image-to-video

x-ai/grok-imagine-video/image-to-video

X-AI Grok Imagine Video transforms images into videos using xAI's Grok Imagine Video model. Animate still images with natural motion, scene continuity, and synchronized audio. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

x-ai/grok-imagine-video/edit-video
video-to-video

x-ai/grok-imagine-video/edit-video

X-AI Grok Imagine Video Edit enables video editing using xAI's Grok Imagine Video model. Transform and modify existing videos with text prompts for seamless AI-powered edits. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

x-ai/grok-imagine-video/video-extend
video-extend

x-ai/grok-imagine-video/video-extend

X-AI Grok Imagine Video Extend turns short clips into longer videos with smooth motion continuity and natural scene extension. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

x-ai/grok-imagine-video/reference-to-video
image-to-video

x-ai/grok-imagine-video/reference-to-video

X-AI Grok Imagine Video Reference-to-Video generates videos from multiple reference images with preserved identity, style, and scene composition. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Grok Models

xAI's Grok Imagine is a powerful suite of AI-native image and video generation models, offering full creative control from text-to-image, image editing, to multi-modal video generation. Built on xAI's frontier reasoning capabilities, Grok Imagine delivers exceptional prompt understanding, cinematic quality, and production-ready outputs.

🎬 Grok Imagine Video — Edit, Animate & Generate

Grok Imagine Video provides three specialized endpoints for video creation: generate from text, animate images, or transform existing videos with AI-powered editing.

  1. Grok Imagine Video Text-to-Video — Generate high-quality videos from text prompts with strong motion coherence and cinematic framing.
  2. x-ai/grok-imagine-video/text-to-video
  3. Grok Imagine Video Image-to-Video — Bring still images to life with natural, fluid motion while preserving subject identity and composition.
  4. x-ai/grok-imagine-video/image-to-video
  5. Grok Imagine Video Edit — Transform and remix existing videos with AI-powered editing for style transfer, scene modification, and creative effects.
  6. x-ai/grok-imagine-video/edit-video

🖼️ Grok Imagine Image — Create & Edit

Grok's image generation models deliver stunning visuals with exceptional prompt adherence and artistic versatility.

  1. Grok Imagine Image Text-to-Image — Generate detailed, photorealistic or stylized images from text with superior prompt understanding.
  2. x-ai/grok-imagine-image/text-to-image
  3. Grok Imagine Image Edit — Precisely edit and refine images with controlled modifications while maintaining visual consistency.
  4. x-ai/grok-imagine-image/edit
  5. Grok 2 Image — xAI's flagship text-to-image model with frontier-level quality and creative flexibility.
  6. x-ai/grok-2-image

✨ Highlights

  1. Frontier Prompt Understanding: Powered by Grok's advanced reasoning for exceptional text comprehension and creative interpretation.
  2. Cinematic Video Quality: Smooth motion, consistent subjects, and professional-grade output.
  3. Versatile Image Generation: From photorealistic to artistic styles with precise control.
  4. Full Creative Pipeline: Text-to-image, image editing, and multi-modal video generation in one unified suite.
  5. Production-Ready: Fast inference with reliable, consistent results for commercial workflows.

Grok Models API — pricing & performance

Run any model in the Grok Models collection through a single REST API. Pay per generation — no subscriptions, no minimums — with industry-leading latency on a 99.9% uptime infrastructure.

Why run Grok Models on WaveSpeedAI

Transparent pricing

Per-call pricing for every Grok Models model. The price is listed on each model page — no platform fees on top.

Optimized for low latency

Most Grok Models image models complete in under 2 seconds. Video and 3D models run several times faster than self-hosted alternatives.

99.9% uptime

Multi-region failover and automatic retries keep your production traffic online — even during provider outages.

Frequently asked questions

How much does the Grok Models API cost?+

Each model has its own per-call price listed on the model page. We bill per successful generation, with no subscription fees or minimums.

How fast are Grok Models models on WaveSpeedAI?+

Image models in this collection typically complete in under 2 seconds. Video and 3D models depend on duration and resolution but are usually several times faster than self-hosted runs.

Can I try the API without a credit card?+

Có — mỗi tài khoản nhận $1 tín dụng miễn phí khi đăng ký, đủ để thử hầu hết các mô hình Grok Models mà không cần thẻ tín dụng.

Are there rate limits?+

Standard accounts have generous concurrent-job limits. Enterprise plans offer custom RPM, higher concurrency, and dedicated capacity — contact sales for details.