ลด 50% โมเดล Vidu Q3 และ Q3 Pro · เฉพาะที่ WaveSpeedAI | 20 พ.ค. – 2 มิ.ย.

Qwen Image Text to Image 2512 API

wavespeed-ai /

Qwen Image 2512 is Qwen's latest text-to-image model with enhanced prompt understanding, superior text rendering, and versatile aspect ratio support. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

text-to-image
อินพุต
width
height
1024 × 1024 px
Range: 256 - 1536
If set to true, the function will wait for the result to be generated and uploaded before returning the response. This property is only available through the API.
If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.

ว่าง

A 25-year-old woman with flowing auburn hair, captured in golden hour light streaming through venetian blinds, creating dramatic shadow patterns across her face. Shot on Hasselblad H6D-100c, 85mm f/1.4 lens, shallow depth of field, film grain, Kodak Portra 400 color science. Cinematic composition with negative space, melancholic atmosphere.

$0.02ต่อครั้ง·~50 / $1

ต่อไป:

ตัวอย่างดูทั้งหมด

A 25-year-old woman with flowing auburn hair, captured in golden hour light streaming through venetian blinds, creating dramatic shadow patterns across her face. Shot on Hasselblad H6D-100c, 85mm f/1.4 lens, shallow depth of field, film grain, Kodak Portra 400 color science. Cinematic composition with negative space, melancholic atmosphere.
Abandoned Art Deco cinema interior, dust particles floating in shafts of light from broken skylights, ornate ceiling details crumbling, velvet seats covered in decades of debris. Wide angle lens distortion, HDR dynamic range, mysterious atmosphere, urbex photography aesthetic.
A colossal ancient tree growing through a collapsed cathedral, roots wrapped around gothic pillars, bioluminescent fungi illuminating the darkness, tiny floating spirits drifting upward like embers. Painted in the style of Craig Mullins meets Hayao Miyazaki, rich atmospheric perspective, matte painting quality, 8K resolution, epic scale.
Venice canal scene in the style of John Singer Sargent, loose impressionistic brushwork capturing light dancing on water, gondolas in soft focus background, palazzo facades in warm terracotta and faded ochre. Oil painting texture, visible canvas weave, museum quality reproduction, gilt frame crop.
Lone motorcyclist stopped at abandoned gas station at dusk, removing helmet to reveal weathered face and grey hair, wanted poster with her younger face peeling off the wall behind her, dust storm approaching on the horizon. Coen Brothers Americana, Cormac McCarthy desolation, you can hear the silence, she's been running for decades.

โมเดลที่เกี่ยวข้อง

README

Qwen Image 2512

Qwen Image 2512 is latest text-to-image generation model from the Qwen AI family. It excels at understanding natural language prompts and producing high-quality images with exceptional text rendering capabilities — perfect for creating posters, signage, logos, and designs requiring readable text.

Why Choose This?

  • Superior text rendering Accurately generates legible text within images, including multiple languages, fonts, and layouts. Ideal for designs requiring readable text elements.

  • Enhanced prompt understanding Interprets complex, detailed prompts with better comprehension of subject relationships, spatial arrangements, and stylistic nuances.

  • Flexible sizing Supports custom width and height configurations for various use cases — social media, presentations, print, and web content.

  • Consistent quality across styles Produces high-quality results whether you're creating photorealistic images, illustrations, concept art, or abstract designs.

  • Prompt Enhancer Built-in tool to automatically improve your prompts for better generation results.

Parameters

ParameterRequiredDescription
promptYesDescribe the image you want to create
widthNoImage width in pixels (default: 1024)
heightNoImage height in pixels (default: 1024)
seedNoRandom seed for reproducible results (-1 for random)
output_formatNoOutput format: jpeg, png, or webp

Output Format Options

  • jpeg — Smaller file size, good for photos and web use
  • png — Lossless quality, supports transparency, best for graphics with text
  • webp — Modern format with better compression, good browser support

How to Use

  1. Write your prompt — describe the image you want, including style, composition, lighting, and mood.
  2. Adjust size — set width and height for your desired dimensions.
  3. Set seed — use -1 for random results, or specify a number for reproducibility.
  4. Choose output format — select jpeg, png, or webp based on your needs.
  5. Run — click Run, preview the result, and iterate if needed.

Pricing

ItemCost
Per image$0.02

Simple flat-rate pricing regardless of image size.

Best Use Cases

  • Marketing and Advertising — Create eye-catching visuals with text for ads, posters, and promotional materials.
  • Social Media Content — Generate engaging images optimized for different platform formats.
  • Product Design — Visualize concepts, mockups, and packaging designs with integrated text.
  • Branding and Identity — Design logos, signage, and branded visuals with readable text elements.
  • Editorial and Publishing — Produce illustrations, cover art, and visual content for articles.

Pro Tips

  • Be specific in your prompts — include subject, style, lighting, camera angle, and atmosphere for best results.
  • For text in images, explicitly specify the exact text, font style, and placement (e.g., "poster with the text SUMMER SALE in bold red letters at the top").
  • Use the same seed with the same prompt to reproduce identical outputs.
  • This model is specifically optimized for generating readable text within images.

Notes

  • Please ensure your prompts comply with content guidelines. If an error occurs, review your prompt and try again.

Related Models

การเข้าถึง:เว็บไซต์นี้ใช้โมเดล AI ที่จัดหาโดยบุคคลที่สาม

Qwen Image Text To Image 2512 API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/wavespeed-ai/qwen-image/text-to-image-2512 with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Qwen Image Text To Image 2512 below.

HTTP example
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/qwen-image/text-to-image-2512" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d '{
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "size": "1024*1024",
    "seed": -1,
    "output_format": "jpeg",
    "enable_sync_mode": false,
    "enable_base64_output": false
}'

# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY"

# When status is "completed", read the output from data.outputs[0].
Node.js example
// npm install wavespeed
const WaveSpeed = require('wavespeed');

const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env

const result = await client.run("wavespeed-ai/qwen-image/text-to-image-2512", {
        "prompt": "A cinematic shot of a city at sunset, soft golden light",
        "size": "1024*1024",
        "seed": -1,
        "output_format": "jpeg",
        "enable_sync_mode": false,
        "enable_base64_output": false
});

console.log(result.outputs[0]); // → URL of the generated output
Python example
# pip install wavespeed
import wavespeed

output = wavespeed.run(
    "wavespeed-ai/qwen-image/text-to-image-2512",
    {
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "size": "1024*1024",
    "seed": -1,
    "output_format": "jpeg",
    "enable_sync_mode": false,
    "enable_base64_output": false
}
)

print(output["outputs"][0])  # → URL of the generated output

Qwen Image Text To Image 2512 API — Frequently asked questions

What is the Qwen Image Text To Image 2512 API?

Qwen Image Text To Image 2512 is a WaveSpeedAI model for image generation, exposed as a REST API on WaveSpeedAI. Qwen Image 2512 is Qwen's latest text-to-image model with enhanced prompt understanding, superior text rendering, and versatile aspect ratio support. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing. You can call it programmatically or try it from the playground above.

How do I call the Qwen Image Text To Image 2512 API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/wavespeed-ai/qwen-image-text-to-image-2512.

How much does Qwen Image Text To Image 2512 cost per run?

Qwen Image Text To Image 2512 starts at $0.020 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does Qwen Image Text To Image 2512 accept?

Key inputs: `prompt`, `size`, `seed`, `enable_base64_output`, `enable_sync_mode`, `output_format`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/wavespeed-ai/qwen-image-text-to-image-2512.

How long does Qwen Image Text To Image 2512 take to generate?

Average end-to-end generation time on WaveSpeedAI is around 4 seconds per request — measured across recent runs. Queue time scales with global demand; live status is visible in the prediction record.

Can I use Qwen Image Text To Image 2512 outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (WaveSpeedAI). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.