GPT Image 1 Mini is a cost-efficient multimodal OpenAI model powered by GPT-5 that turns text or image prompts into high-quality images. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.
ว่าง

$0.02ต่อครั้ง·~50 / $1

Epic fantasy battle scene. A colossal, fire-breathing red dragon is locked in combat with hundreds of tiny knights on a crumbling medieval castle bridge. Below, a vast army clashes on the battlefield under a stormy, dark sky lit by dragon fire. Matte painting, hyper-detailed, cinematic lighting.

A grand, panoramic view of an ancient Roman triumphal procession entering the city through a massive marble arch. Thousands of cheering citizens line the streets. Legions of soldiers march in perfect formation, followed by chariots and exotic animals. Bright sunny day, highly detailed, historical painting style.

The Rain Walker Prompt: A person walking alone in the rain with an umbrella, city lights reflected on wet pavement, semi-realistic cartoon style, cinematic lighting, shallow depth of field. Text on image: “Some roads are meant to be walked alone.” Text placement: Reflected across the puddles on the ground, slightly distorted like water ripples, blending naturally with reflections.

Concept art for an indie music festival poster. Abstract, colorful geometric shapes and sound waves overlapping a silhouette of a guitarist against a vibrant sunset sky. Retro, flat design illustration style, space for text.

A library where the bookshelves are made of flowing waterfalls, and tiny glowing fish swim between the books. Digital art, hyperrealistic.

A bustling Tokyo street scene at night, filled with neon signs and flying cars, rendered in a vibrant pixel art style. 16-bit.

A cat astronaut wearing a vintage diving helmet, floating in a sea of swirling coffee and cream nebula. Whimsical illustration.

A lone, tiny robot holding a single glowing flower sits sadly amidst the colossal, overgrown ruins of a forgotten city at dawn. Mist hangs heavy in the air. Cinematic, moody, digital painting.

Simple, clean vector logo concept for a cozy neighborhood cafe. A stylized, hand-drawn coffee cup emitting steam that forms into a subtle heart shape. Warm brown and cream color palette. Flat design, white background.

A majestic clockwork owl with intricate gears and glowing amber eyes, sitting on a pile of ancient books in a wizard's study. Steampunk illustration.

Close-up portrait of a beautiful young woman in a modern city at sunset, warm orange lighting reflecting on her face, semi-realistic cartoon style, detailed skin texture, soft hair strands, cinematic depth, 85 mm lens look, bokeh background. Text on image: “The city glows, and so do you.”
GPT Image 1 Mini is a cost-efficient multimodal generation model built on OpenAI’s GPT-5 architecture. It combines advanced language understanding with optimized image synthesis to generate high-quality visuals from natural text prompts — ideal for UI design, concept art, and creative visualization tasks.
🧠 GPT-5-powered Understanding Precisely interprets complex text prompts and stylistic descriptions to deliver coherent, context-aware visuals.
🎨 Efficient Image Generation Creates polished, high-fidelity images while maintaining low latency and minimal compute cost.
💡 Native Multimodality Handles both text and visual reasoning seamlessly for design, layout, and concept synthesis.
💰 Cost-Effective Performance Delivers professional-grade image generation at a fraction of standard model pricing.
🧩 Ideal for UI/UX Design Excels at generating clean, modern design concepts, including app interfaces, dashboards, and product layouts.
| Parameter | Description |
|---|---|
| prompt* | A text description of the desired image (e.g., “UI design inspiration for a travel app screen…”) |
UI design inspiration for a travel app screen. Shows beautiful cards with stunning photos of travel destinations (beach, mountain). Clean layout, vibrant gradients (blue/green), modern sans-serif font style. Minimalist, user-friendly.
| Metric | Price |
|---|---|
| Per image generated | $0.02 / image |
Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/openai/gpt-image-1-mini/text-to-image with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Gpt Image 1 Mini Text To Image below.
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/openai/gpt-image-1-mini/text-to-image" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $WAVESPEED_API_KEY" \
-d '{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"enable_sync_mode": false,
"enable_base64_output": false
}'
# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
-H "Authorization: Bearer $WAVESPEED_API_KEY"
# When status is "completed", read the output from data.outputs[0].// npm install wavespeed
const WaveSpeed = require('wavespeed');
const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env
const result = await client.run("openai/gpt-image-1-mini/text-to-image", {
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"enable_sync_mode": false,
"enable_base64_output": false
});
console.log(result.outputs[0]); // → URL of the generated output# pip install wavespeed
import wavespeed
output = wavespeed.run(
"openai/gpt-image-1-mini/text-to-image",
{
"prompt": "A cinematic shot of a city at sunset, soft golden light",
"enable_sync_mode": false,
"enable_base64_output": false
}
)
print(output["outputs"][0]) # → URL of the generated outputGpt Image 1 Mini Text To Image is a OpenAI model for image generation, exposed as a REST API on WaveSpeedAI. GPT Image 1 Mini is a cost-efficient multimodal OpenAI model powered by GPT-5 that turns text or image prompts into high-quality images. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.
POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/openai/openai-gpt-image-1-mini-text-to-image.
Gpt Image 1 Mini Text To Image starts at $0.020 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.
Key inputs: `prompt`, `enable_base64_output`, `enable_sync_mode`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/openai/openai-gpt-image-1-mini-text-to-image.
Average end-to-end generation time on WaveSpeedAI is around 57 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.
Commercial usage rights depend on the model's license, set by its provider (OpenAI). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.