
text-to-image
Idle

이 요청에는 $0.042 실행당가 필요합니다.
$1으로 이 모델을 약 23회 실행할 수 있습니다.
추가 안내:














GPT Image 1 is OpenAI’s latest multimodal image generation model, built to understand both text and image inputs and produce visually coherent, high-quality image outputs. It combines the reasoning power of GPT-4-Turbo with DALL·E-class visual synthesis—allowing for creative, controllable, and context-aware generation across illustration, photography, design, and visualization tasks.
Multimodal Understanding Accepts both text and image inputs, enabling style transfer, editing, or contextual composition.
Flexible Styles Produces photorealistic renders, stylized artwork, concept art, infographics, and 3D-style illustrations.
High Visual Fidelity Maintains object relationships, lighting consistency, and color balance with strong adherence to prompts.
Accurate Text Rendering Capable of generating clean typography—ideal for posters, memes, comics, and branding visuals.
Knowledge-Grounded Creativity Uses GPT-4’s world knowledge to generate factual, contextually appropriate visuals.
1024×1024, 1024×1536, and 1536×1024.low, medium, and high.| Resolution | Low ($) | Medium ($) | High ($) |
|---|---|---|---|
| 1024 × 1024 | 0.011 | 0.042 | 0.167 |
| 1024 × 1536 / 1536 × 1024 | 0.016 | 0.063 | 0.250 |
Write prompts that specify style, subject, composition, and lighting.
Example: “A small robot exploring an abandoned city, cartoon style, bright colors.”
Use high quality for detailed or large-format outputs.
Prefer landscape (1536×1024) for cinematic or wide compositions, and portrait (1024×1536) for characters or vertical art.