
text-to-image
Idle

Your request will cost $0.042 per run.
For $1 you can run this model approximately 23 times.
One more thing::














GPT Image 1 is OpenAI’s latest multimodal image generation model, built to understand both text and image inputs and produce visually coherent, high-quality image outputs. It combines the reasoning power of GPT-4-Turbo with DALL·E-class visual synthesis—allowing for creative, controllable, and context-aware generation across illustration, photography, design, and visualization tasks.
Multimodal Understanding Accepts both text and image inputs, enabling style transfer, editing, or contextual composition.
Flexible Styles Produces photorealistic renders, stylized artwork, concept art, infographics, and 3D-style illustrations.
High Visual Fidelity Maintains object relationships, lighting consistency, and color balance with strong adherence to prompts.
Accurate Text Rendering Capable of generating clean typography—ideal for posters, memes, comics, and branding visuals.
Knowledge-Grounded Creativity Uses GPT-4’s world knowledge to generate factual, contextually appropriate visuals.
1024×1024, 1024×1536, and 1536×1024.low, medium, and high.| Resolution | Low ($) | Medium ($) | High ($) |
|---|---|---|---|
| 1024 × 1024 | 0.011 | 0.042 | 0.167 |
| 1024 × 1536 / 1536 × 1024 | 0.016 | 0.063 | 0.250 |
Write prompts that specify style, subject, composition, and lighting.
Example: “A small robot exploring an abandoned city, cartoon style, bright colors.”
Use high quality for detailed or large-format outputs.
Prefer landscape (1536×1024) for cinematic or wide compositions, and portrait (1024×1536) for characters or vertical art.