Home/Explore/wavespeed-ai/qwen-image/text-to-image

text-to-image

logo

wavespeed-ai/qwen-image/text-to-image

Qwen-Image — a 20B MMDiT model for next-gen text-to-image generation.

Doc
width
height
If set to true, the function will wait for the image to be generated and uploaded before returning the response. It allows you to get the image directly in the response. This property is only available through the API.
If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.

Idle

A girl with little freckles and messy red hair sitting on a rooftop during sunset, denim jacket slightly worn, holding a Polaroid camera, city skyline glowing in soft hues behind her

Your request will cost $0.02 per run.

For $1 you can run this model approximately 50 times.

ExamplesView all

An elderly baker dusting flour onto a wooden table in a sunlit kitchen, deep wrinkles on his hands, flour particles suspended in the air, warm rustic tones and natural shadows
A child looking out from the backseat of an old car during a rainy afternoon, raindrops trickling down the window, reflections of city lights blending with the soft glow of the dashboard
A barista in a cozy café pouring latte art into a ceramic cup, soft morning sunlight streaming through the window blinds, coffee steam rising in the air, bookshelves in the background
A fashion-forward woman walking across a cobblestone street in Paris, wearing a camel trench coat and high heels, wind catching her scarf, soft golden hour light hitting her cheekbones
A man sketching in a small studio filled with canvases and charcoal dust, his hands stained with black pigment, a shaft of sunlight highlighting the textured surface of his work
A girl with little freckles and messy red hair sitting on a rooftop during sunset, denim jacket slightly worn, holding a Polaroid camera, city skyline glowing in soft hues behind her
A young couple sitting on a picnic blanket in an overgrown field, dappled sunlight through tall grass, soft focus background, worn jeans, vintage thermos, intimacy and warmth in their posture
A freckled teenage girl with untamed auburn curls pulled into a loose bun, wearing a corduroy jacket with frayed sleeves and vintage enamel pins, leaning against a graffiti-covered brick wall in Brooklyn at dusk, chewing a red lollipop, scattered trash and old flyers around her feet, amber streetlight casting a long shadow behind her, expression caught between boredom and defiance

README

Qwen-Image — a 20B MMDiT model for next-gen text-to-image generation. Especially strong at creating stunning graphic posters with native text. Now open-source.

Key Highlights:

  • SOTA text rendering — rivals GPT-4o in English, best-in-class for Chinese
  • In-pixel text generation — no overlays, fully integrated
  • Bilingual support, diverse fonts, complex layouts

Also excels at general image generation — from photorealistic to anime, impressionist to minimalist. A true creative powerhouse.