Grok 2 Image is xAI’s latest image generation model that turns simple text prompts into sharp, photorealistic visuals in seconds. From product shots to social posts and concept art, it follows your instructions closely so you can go from idea to production-ready image with just one prompt. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Idle

$0.07per run·~14 / $1

Cinematic aerial shot of a colossal biomechanical city-ship drifting through a nebula at golden hour, intricate organic-metallic architecture covered in bioluminescent veins, massive translucent wings made of light, thousands of tiny ships swarming like fireflies, warm rim lighting against cold cosmic background, shot on 65mm IMAX film, anamorphic lens flares, insane detail, photorealistic, 8K

Hyper-realistic classical oil painting portrait of a 24-year-old East Asian woman with porcelain skin and subtle freckles, wearing 18th century European aristocratic attire with intricate lace and pearls, soft Rembrandt lighting, dramatic chiaroscuro, individual strands of hair, micro skin texture, in the style of John Singer Sargent and Bouguereau, museum quality, 8K

Cinematic aerial view of a post-apocalyptic Tokyo at sunrise, overgrown with massive glowing cherry blossom trees that emit pink particles, abandoned Shibuya crossing completely covered in petals, giant broken holographic billboards still flickering, golden rays piercing through thick fog, thousands of crows flying overhead, ultra-realistic, shot on 70mm IMAX, anamorphic lens flares, emotional masterpiece, 8K

Breathtaking underwater scene of an ancient sunken cyberpunk city, massive skyscrapers covered in coral and bioluminescent algae, schools of whales swimming between buildings, rays of sunlight piercing through the surface creating god rays, abandoned mecha lying on the seabed, ethereal beauty, hyper-realistic, national geographic style meets blade runner 2049, 8K

Dramatic panoramic view of Shanghai Bund 200 years after apocalypse, iconic skyline completely overtaken by massive glowing mushrooms and vines, Oriental Pearl Tower wrapped in bioluminescent flora, aurora borealis in the sky, abandoned ships floating in the Huangpu River covered in moss, lone figure standing on the bund, emotional and hauntingly beautiful, hyper-realistic, 8K
Grok 2 Image turns a natural-language text prompt into vivid, realistic images. It’s xAI’s flagship image generation model, tuned for marketing creatives, social posts, product visuals, concept art, and more.
In the API, you use the grok-2-image. A single request can generate multiple images, making it easy to explore variations on a single idea.
Photorealistic, high-fidelity imagery Trained to produce detailed textures, convincing lighting, and sharp compositions that work well for ads, hero images, and product renders.
Strong prompt following Optimized for following descriptive prompts closely, capturing objects, layouts, and styles specified in your text while minimizing “prompt drift.”
Flexible visual styles Handles realistic photography, digital illustration, stylized artwork, and concept sketches, making it useful for storyboards, thumbnails, and creative exploration.
Multi-image generation in one shot A single request can generate up to 10 JPG images, so you can explore multiple creative directions from one prompt.
Competitive per-image pricing Images are billed per output image, keeping costs predictable for batch runs and A/B creative testing.
Prompt refinement under the hood Before reaching the image model, your text prompt can be lightly revised by a chat model to improve clarity, often leading to more accurate results without extra work on your side.
Billing is based on the number of images generated.
Each image will cost $0.07.
Output format: Images are returned in JPG format.
Per-job limits:
Up to 10 images per request
Additional throughput limits depend on your account/plan.
Prompt tips:
Be concrete about objects, layout, and style (e.g., “centered product on plain background”).
Avoid contradictory instructions in a single prompt.
Iterate: start simple, then gradually add details once you like the base composition.
Nano Banana Pro High-quality text-to-image generation from Google, suitable for product shots, concept art, and creative visuals.
Seedream v4.5 A versatile image generation model from, tuned for detailed scenes, characters, and stylized compositions.
Kling Image O1 A flagship image model from Kwaivgi/Kuaishou’s Kling series, focused on sharp, high-fidelity visuals and strong prompt adherence.
Qwen Image An Qwen-based generator hosted by WaveSpeedAI, delivering robust semantic understanding and reliable text-to-image rendering across diverse styles.
Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/x-ai/grok-2-image with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Grok 2 Image below.
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/x-ai/grok-2-image" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $WAVESPEED_API_KEY" \
-d '{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"num_images": 1,
"enable_sync_mode": false,
"enable_base64_output": false
}'
# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
-H "Authorization: Bearer $WAVESPEED_API_KEY"
# When status is "completed", read the output from data.outputs[0].// npm install wavespeed
const WaveSpeed = require('wavespeed');
const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env
const result = await client.run("x-ai/grok-2-image", {
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"num_images": 1,
"enable_sync_mode": false,
"enable_base64_output": false
});
console.log(result.outputs[0]); // → URL of the generated output# pip install wavespeed
import wavespeed
output = wavespeed.run(
"x-ai/grok-2-image",
{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"num_images": 1,
"enable_sync_mode": false,
"enable_base64_output": false
}
)
print(output["outputs"][0]) # → URL of the generated outputGrok 2 Image is a xAI model for image generation, exposed as a REST API on WaveSpeedAI. Grok 2 Image is xAI’s latest image generation model that turns simple text prompts into sharp, photorealistic visuals in seconds. From product shots to social posts and concept art, it follows your instructions closely so you can go from idea to production-ready image with just one prompt. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.
POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/x-ai/x-ai-grok-2-image.
Grok 2 Image starts at $0.070 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.
Key inputs: `prompt`, `enable_base64_output`, `enable_sync_mode`, `num_images`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/x-ai/x-ai-grok-2-image.
Average end-to-end generation time on WaveSpeedAI is around 13 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.
Commercial usage rights depend on the model's license, set by its provider (xAI). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.