Openai Gpt Image 1 Text To Image
Playground
Try it on WavespeedAI!OpenAI GPT Image-1 generates images from text prompts from OpenAI’s latest text-to-image model, ideal for creating visual assets. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
Features
OpenAI GPT Image 1
GPT Image 1 is OpenAI’s latest multimodal image generation model, built to understand both text and image inputs and produce visually coherent, high-quality image outputs. It combines the reasoning power of GPT-4-Turbo with DALL·E-class visual synthesis—allowing for creative, controllable, and context-aware generation across illustration, photography, design, and visualization tasks.
🧠 Key Features
-
Multimodal Understanding Accepts both text and image inputs, enabling style transfer, editing, or contextual composition.
-
Flexible Styles Produces photorealistic renders, stylized artwork, concept art, infographics, and 3D-style illustrations.
-
High Visual Fidelity Maintains object relationships, lighting consistency, and color balance with strong adherence to prompts.
-
Accurate Text Rendering Capable of generating clean typography—ideal for posters, memes, comics, and branding visuals.
-
Knowledge-Grounded Creativity Uses GPT-4’s world knowledge to generate factual, contextually appropriate visuals.
⚙️ Parameters
- Prompt: Required text description of the desired image.
- Size: Supports
1024×1024,1024×1536, and1536×1024. - Quality: Choose between
low,medium, andhigh.
💰 Pricing
| Resolution | Low ($) | Medium ($) | High ($) |
|---|---|---|---|
| 1024 × 1024 | 0.011 | 0.042 | 0.167 |
| 1024 × 1536 / 1536 × 1024 | 0.016 | 0.063 | 0.250 |
💡 Tips for Best Results
-
Write prompts that specify style, subject, composition, and lighting.
Example: “A small robot exploring an abandoned city, cartoon style, bright colors.”
-
Use high quality for detailed or large-format outputs.
-
Prefer landscape (1536×1024) for cinematic or wide compositions, and portrait (1024×1536) for characters or vertical art.
📝 Notes
- All generated content follows OpenAI’s safety and content policies.
- If a prompt triggers moderation, rephrase or simplify it.
- This model supports multi-image input via API, enabling creative editing and composition workflows.
- For performance and latency-sensitive cases, use medium quality as the balanced default.
Authentication
For authentication details, please refer to the Authentication Guide.
API Endpoints
Submit Task & Query Result
# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/openai/gpt-image-1/text-to-image" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
"size": "1024*1024",
"quality": "medium",
"enable_sync_mode": false,
"enable_base64_output": false
}'
# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"
Parameters
Task Submission Parameters
Request Parameters
| Parameter | Type | Required | Default | Range | Description |
|---|---|---|---|---|---|
| prompt | string | Yes | - | The positive prompt for the generation. | |
| size | string | No | 1024*1024 | 1024*1024, 1024*1536, 1536*1024 | The size of the generated media in pixels (width*height). |
| quality | string | No | medium | low, medium, high | The quality of the generated image. |
| enable_sync_mode | boolean | No | false | - | If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API. |
| enable_base64_output | boolean | No | false | - | If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API. |
Response Parameters
| Parameter | Type | Description |
|---|---|---|
| code | integer | HTTP status code (e.g., 200 for success) |
| message | string | Status message (e.g., “success”) |
| data.id | string | Unique identifier for the prediction, Task Id |
| data.model | string | Model ID used for the prediction |
| data.outputs | array | Array of URLs to the generated content (empty when status is not completed) |
| data.urls | object | Object containing related API endpoints |
| data.urls.get | string | URL to retrieve the prediction result |
| data.has_nsfw_contents | array | Array of boolean values indicating NSFW detection for each output |
| data.status | string | Status of the task: created, processing, completed, or failed |
| data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
| data.error | string | Error message (empty if no error occurred) |
| data.timings | object | Object containing timing details |
| data.timings.inference | integer | Inference time in milliseconds |