Vidu Contest
WaveSpeed.ai
Inicio/Explorar/Qwen AI Models/wavespeed-ai/qwen-image/text-to-image-2512
text-to-image

text-to-image

Qwen Image 2512

wavespeed-ai/qwen-image/text-to-image-2512

Qwen Image 2512 is Alibaba Qwen's latest text-to-image model with enhanced prompt understanding, superior text rendering, and versatile aspect ratio support. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

Input
width
height
1024 × 1024 px
Range: 256 - 1536
If set to true, the function will wait for the result to be generated and uploaded before returning the response. This property is only available through the API.
If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.

Idle

A 25-year-old woman with flowing auburn hair, captured in golden hour light streaming through venetian blinds, creating dramatic shadow patterns across her face. Shot on Hasselblad H6D-100c, 85mm f/1.4 lens, shallow depth of field, film grain, Kodak Portra 400 color science. Cinematic composition with negative space, melancholic atmosphere.

Tu solicitud costará $0.02 por ejecución.

Con $1 puedes ejecutar este modelo aproximadamente 50 veces.

Una cosa más:

EjemplosVer todo

A 25-year-old woman with flowing auburn hair, captured in golden hour light streaming through venetian blinds, creating dramatic shadow patterns across her face. Shot on Hasselblad H6D-100c, 85mm f/1.4 lens, shallow depth of field, film grain, Kodak Portra 400 color science. Cinematic composition with negative space, melancholic atmosphere.
Abandoned Art Deco cinema interior, dust particles floating in shafts of light from broken skylights, ornate ceiling details crumbling, velvet seats covered in decades of debris. Wide angle lens distortion, HDR dynamic range, mysterious atmosphere, urbex photography aesthetic.
A colossal ancient tree growing through a collapsed cathedral, roots wrapped around gothic pillars, bioluminescent fungi illuminating the darkness, tiny floating spirits drifting upward like embers. Painted in the style of Craig Mullins meets Hayao Miyazaki, rich atmospheric perspective, matte painting quality, 8K resolution, epic scale.
Venice canal scene in the style of John Singer Sargent, loose impressionistic brushwork capturing light dancing on water, gondolas in soft focus background, palazzo facades in warm terracotta and faded ochre. Oil painting texture, visible canvas weave, museum quality reproduction, gilt frame crop.
Lone motorcyclist stopped at abandoned gas station at dusk, removing helmet to reveal weathered face and grey hair, wanted poster with her younger face peeling off the wall behind her, dust storm approaching on the horizon. Coen Brothers Americana, Cormac McCarthy desolation, you can hear the silence, she's been running for decades.

README

Qwen Image 2512

Qwen Image 2512 is Alibaba's latest text-to-image generation model from the Qwen AI family. It excels at understanding natural language prompts and producing high-quality images with exceptional text rendering capabilities — perfect for creating posters, signage, logos, and designs requiring readable text.

Why Choose This?

  • Superior text rendering Accurately generates legible text within images, including multiple languages, fonts, and layouts. Ideal for designs requiring readable text elements.

  • Enhanced prompt understanding Interprets complex, detailed prompts with better comprehension of subject relationships, spatial arrangements, and stylistic nuances.

  • Flexible sizing Supports custom width and height configurations for various use cases — social media, presentations, print, and web content.

  • Consistent quality across styles Produces high-quality results whether you're creating photorealistic images, illustrations, concept art, or abstract designs.

  • Prompt Enhancer Built-in tool to automatically improve your prompts for better generation results.

Parameters

ParameterRequiredDescription
promptYesDescribe the image you want to create
widthNoImage width in pixels (default: 1024)
heightNoImage height in pixels (default: 1024)
seedNoRandom seed for reproducible results (-1 for random)
output_formatNoOutput format: jpeg, png, or webp

Output Format Options

  • jpeg — Smaller file size, good for photos and web use
  • png — Lossless quality, supports transparency, best for graphics with text
  • webp — Modern format with better compression, good browser support

How to Use

  1. Write your prompt — describe the image you want, including style, composition, lighting, and mood.
  2. Adjust size — set width and height for your desired dimensions.
  3. Set seed — use -1 for random results, or specify a number for reproducibility.
  4. Choose output format — select jpeg, png, or webp based on your needs.
  5. Run — click Run, preview the result, and iterate if needed.

Pricing

ItemCost
Per image$0.02

Simple flat-rate pricing regardless of image size.

Best Use Cases

  • Marketing and Advertising — Create eye-catching visuals with text for ads, posters, and promotional materials.
  • Social Media Content — Generate engaging images optimized for different platform formats.
  • Product Design — Visualize concepts, mockups, and packaging designs with integrated text.
  • Branding and Identity — Design logos, signage, and branded visuals with readable text elements.
  • Editorial and Publishing — Produce illustrations, cover art, and visual content for articles.

Pro Tips

  • Be specific in your prompts — include subject, style, lighting, camera angle, and atmosphere for best results.
  • For text in images, explicitly specify the exact text, font style, and placement (e.g., "poster with the text SUMMER SALE in bold red letters at the top").
  • Use the same seed with the same prompt to reproduce identical outputs.
  • This model is specifically optimized for generating readable text within images.

Notes

  • Please ensure your prompts comply with content guidelines. If an error occurs, review your prompt and try again.

Related Models