NEW YEAR SALE: Get 15% Extra Credits, up to $150.Top Up Now!
Home/Explore/Qwen AI Models/wavespeed-ai/qwen-image/text-to-image-2512
text-to-image

text-to-image

Qwen Image 2512

wavespeed-ai/qwen-image/text-to-image-2512

Qwen Image 2512 is Alibaba Qwen's latest text-to-image model with enhanced prompt understanding, superior text rendering, and versatile aspect ratio support. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

width
height

Idle

A 25-year-old woman with flowing auburn hair, captured in golden hour light streaming through venetian blinds, creating dramatic shadow patterns across her face. Shot on Hasselblad H6D-100c, 85mm f/1.4 lens, shallow depth of field, film grain, Kodak Portra 400 color science. Cinematic composition with negative space, melancholic atmosphere.

Your request will cost $0.025 per run.

For $1 you can run this model approximately 40 times.

One more thing::

ExamplesView all

A 25-year-old woman with flowing auburn hair, captured in golden hour light streaming through venetian blinds, creating dramatic shadow patterns across her face. Shot on Hasselblad H6D-100c, 85mm f/1.4 lens, shallow depth of field, film grain, Kodak Portra 400 color science. Cinematic composition with negative space, melancholic atmosphere.
Abandoned Art Deco cinema interior, dust particles floating in shafts of light from broken skylights, ornate ceiling details crumbling, velvet seats covered in decades of debris. Wide angle lens distortion, HDR dynamic range, mysterious atmosphere, urbex photography aesthetic.
A colossal ancient tree growing through a collapsed cathedral, roots wrapped around gothic pillars, bioluminescent fungi illuminating the darkness, tiny floating spirits drifting upward like embers. Painted in the style of Craig Mullins meets Hayao Miyazaki, rich atmospheric perspective, matte painting quality, 8K resolution, epic scale.
Venice canal scene in the style of John Singer Sargent, loose impressionistic brushwork capturing light dancing on water, gondolas in soft focus background, palazzo facades in warm terracotta and faded ochre. Oil painting texture, visible canvas weave, museum quality reproduction, gilt frame crop.
Lone motorcyclist stopped at abandoned gas station at dusk, removing helmet to reveal weathered face and grey hair, wanted poster with her younger face peeling off the wall behind her, dust storm approaching on the horizon. Coen Brothers Americana, Cormac McCarthy desolation, you can hear the silence, she's been running for decades.

README

Qwen Image 2512

Qwen Image 2512 is Alibaba's latest text-to-image generation model from the Qwen AI family. It excels at understanding natural language prompts and producing high-quality images with exceptional text rendering capabilities — perfect for creating posters, signage, logos, and designs requiring readable text.

Why Choose This?

  • Superior text rendering Accurately generates legible text within images, including multiple languages, fonts, and layouts. Ideal for designs requiring readable text elements.

  • Enhanced prompt understanding Interprets complex, detailed prompts with better comprehension of subject relationships, spatial arrangements, and stylistic nuances.

  • Flexible sizing Supports custom width and height configurations for various use cases — social media, presentations, print, and web content.

  • Consistent quality across styles Produces high-quality results whether you're creating photorealistic images, illustrations, concept art, or abstract designs.

  • Prompt Enhancer Built-in tool to automatically improve your prompts for better generation results.

Parameters

ParameterRequiredDescription
promptYesDescribe the image you want to create
widthNoImage width in pixels (default: 1024)
heightNoImage height in pixels (default: 1024)
seedNoRandom seed for reproducible results (-1 for random)
output_formatNoOutput format: jpeg, png, or webp

Output Format Options

  • jpeg — Smaller file size, good for photos and web use
  • png — Lossless quality, supports transparency, best for graphics with text
  • webp — Modern format with better compression, good browser support

How to Use

  1. Write your prompt — describe the image you want, including style, composition, lighting, and mood.
  2. Adjust size — set width and height for your desired dimensions.
  3. Set seed — use -1 for random results, or specify a number for reproducibility.
  4. Choose output format — select jpeg, png, or webp based on your needs.
  5. Run — click Run, preview the result, and iterate if needed.

Pricing

ItemCost
Per image$0.025

Simple flat-rate pricing regardless of image size.

Best Use Cases

  • Marketing and Advertising — Create eye-catching visuals with text for ads, posters, and promotional materials.
  • Social Media Content — Generate engaging images optimized for different platform formats.
  • Product Design — Visualize concepts, mockups, and packaging designs with integrated text.
  • Branding and Identity — Design logos, signage, and branded visuals with readable text elements.
  • Editorial and Publishing — Produce illustrations, cover art, and visual content for articles.

Pro Tips

  • Be specific in your prompts — include subject, style, lighting, camera angle, and atmosphere for best results.
  • For text in images, explicitly specify the exact text, font style, and placement (e.g., "poster with the text SUMMER SALE in bold red letters at the top").
  • Use the same seed with the same prompt to reproduce identical outputs.
  • This model is specifically optimized for generating readable text within images.

Notes

  • Please ensure your prompts comply with content guidelines. If an error occurs, review your prompt and try again.

Related Models