
text-to-image
Idle

Your request will cost $0.07 per run.
For $1 you can run this model approximately 14 times.
One more thing::





Grok 2 Image turns a natural-language text prompt into vivid, realistic images.
It’s xAI’s flagship image generation model, tuned for marketing creatives, social posts, product visuals, concept art, and more.
In the API, you use the grok-2-image. A single request can generate multiple images, making it easy to explore variations on a single idea.
Photorealistic, high-fidelity imagery
Trained to produce detailed textures, convincing lighting, and sharp compositions that work well for ads, hero images, and product renders.
Strong prompt following
Optimized for following descriptive prompts closely, capturing objects, layouts, and styles specified in your text while minimizing “prompt drift.”
Flexible visual styles
Handles realistic photography, digital illustration, stylized artwork, and concept sketches, making it useful for storyboards, thumbnails, and creative exploration.
Multi-image generation in one shot
A single request can generate up to 10 JPG images, so you can explore multiple creative directions from one prompt.
Competitive per-image pricing
Images are billed per output image, keeping costs predictable for batch runs and A/B creative testing.
Prompt refinement under the hood
Before reaching the image model, your text prompt can be lightly revised by a chat model to improve clarity, often leading to more accurate results without extra work on your side.
Billing is based on the number of images generated.
Each image will cost $0.07.
Write your prompt
Send the generation job
Download or display the results
Output format:
Images are returned in JPG format.
Per-job limits:
Prompt tips:
Nano Banana Pro High-quality text-to-image generation from Google, suitable for product shots, concept art, and creative visuals.
Seedream v4.5 A versatile image generation model from ByteDance, tuned for detailed scenes, characters, and stylized compositions.
Kling Image O1 A flagship image model from Kwaivgi/Kuaishou’s Kling series, focused on sharp, high-fidelity visuals and strong prompt adherence.
Qwen Image An Alibaba Qwen-based generator hosted by WaveSpeedAI, delivering robust semantic understanding and reliable text-to-image rendering across diverse styles.