Nano Banana 2Nano Banana 2 is live
WaveSpeed.ai
Startseite/Entdecken/Qwen Image 2 Models/wavespeed-ai/qwen-image-2.0-pro/text-to-image
text-to-image

text-to-image

Qwen Image 2.0 Pro

wavespeed-ai/qwen-image-2.0-pro/text-to-image

Qwen Image 2.0 Pro is a professional-grade text-to-image model with superior quality and advanced prompt understanding. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Input
width
height
1024 × 1024 px
Range: 256 - 1536

Idle

A rectangular dinner table shot from above at 45 degrees. Seated around it are 8 people of different ethnicities, ages, and body types. The elderly American grandmother at the head is mid-laugh with her eyes squeezed shut. The toddler in a high chair to her left has spaghetti smeared across both cheeks and is reaching with both hands toward a glass of water. A teenage girl across the table is secretly showing her phone screen to the boy next to her under the table — their hands and the phone visible beneath the tablecloth. A bearded man is pouring wine, the liquid caught mid-pour in a perfect arc. Each person casts correct shadows from the overhead pendant lamp. The table has 8 distinct place settings with different amounts of food remaining on each plate.

Ihre Anfrage kostet $0.07 pro Durchlauf.

Für $1 können Sie dieses Modell ungefähr 14 Mal ausführen.

Noch etwas:

BeispieleAlle anzeigen

A rectangular dinner table shot from above at 45 degrees. Seated around it are 8 people of different ethnicities, ages, and body types. The elderly American grandmother at the head is mid-laugh with her eyes squeezed shut. The toddler in a high chair to her left has spaghetti smeared across both cheeks and is reaching with both hands toward a glass of water. A teenage girl across the table is secretly showing her phone screen to the boy next to her under the table — their hands and the phone visible beneath the tablecloth. A bearded man is pouring wine, the liquid caught mid-pour in a perfect arc. Each person casts correct shadows from the overhead pendant lamp. The table has 8 distinct place settings with different amounts of food remaining on each plate.
A weathered bronze plaque mounted on a mossy stone wall that reads "FOUNDED 1847 — THE BROTHERHOOD OF ETERNAL WANDERERS" in deeply engraved serif lettering, with raindrops trickling down the letters, some letters partially obscured by creeping ivy
Extreme macro close-up of a single dewdrop on a spider web strand, inside the dewdrop is a perfectly refracted upside-down reflection of a vast mountain landscape with snow-capped peaks and a winding river valley, the spider silk shows individual fiber details, background is a soft bokeh sunrise
A detailed anatomical infographic of the human heart and blood circulation system. Cross-section view showing four chambers, valves, aorta, and pulmonary arteries. Clean medical illustration style with labeled arrows indicating oxygenated blood flow in red and deoxygenated blood flow in blue. White background, red and blue dual-tone color scheme. Vector flat design with clear annotations, directional arrows, and simplified yet anatomically accurate proportions.
A photorealistic interior design rendering of a Scandinavian minimalist living room. Open-plan layout with double-height ceiling and floor-to-ceiling windows allowing abundant natural light. Low-profile light oak sofa with off-white linen cushions, round marble coffee table, and a single statement Arco floor lamp. Built-in wall shelving with curated ceramics and potted monstera. Pale birch hardwood flooring with a hand-woven ivory wool rug. Neutral palette of warm whites, soft grays, and natural wood tones. Shot from a 3/4 perspective at eye level, architectural photography style with soft diffused lighting, 35mm wide-angle lens, shallow depth of field focusing on the seating area.

README

Qwen Image 2.0 Pro Text-to-Image

Qwen Image 2.0 Pro is Alibaba's premium text-to-image model, delivering the highest quality output in the Qwen Image 2.0 family. With superior detail rendering, enhanced prompt adherence, and professional-grade visual fidelity, it's ideal for production work requiring maximum quality.

Why Choose This?

  • Pro-tier quality Maximum visual fidelity and detail in the Qwen Image 2.0 family.

  • Superior prompt adherence Best-in-class at following detailed, complex prompts with multiple elements and attributes.

  • Enhanced detail rendering Exceptional at rendering intricate details like hair textures, jewelry, skin tones, and fabric.

  • Flexible aspect ratios Multiple presets including 1:1, 16:9, 9:16, 4:3, 3:4, 3:2, and 2:3.

  • Custom resolution Adjustable width and height from 256 to 1536 pixels.

  • Prompt Enhancer Built-in tool to automatically improve your descriptions.

Parameters

ParameterRequiredDescription
promptYesText description of the desired image
sizeNoAspect ratio preset: 1:1, 16:9, 9:16, 4:3, 3:4, 3:2, 2:3
widthNoCustom width in pixels (range: 256–1536)
heightNoCustom height in pixels (range: 256–1536)
seedNoRandom seed for reproducibility (-1 for random)

How to Use

  1. Write your prompt — describe the image in detail, including specific attributes, styles, and elements.
  2. Choose size — select a preset aspect ratio or customize width/height.
  3. Use Prompt Enhancer (optional) — click to automatically refine your description.
  4. Set seed (optional) — for reproducible results.
  5. Run — submit and download your generated image.

Pricing

OutputCost
Per image$0.07

Best Use Cases

  • Professional Production — High-end visuals requiring maximum quality.
  • Detailed Character Art — Generate characters with specific attributes and fine details.
  • Portrait Photography — Create photorealistic portraits with exceptional detail.
  • Fashion & Beauty — Visualize outfits, hairstyles, makeup, and jewelry with precision.
  • Commercial & Advertising — Premium imagery for marketing and brand campaigns.

Pro Tips

  • Use highly detailed prompts — the Pro model excels at following complex descriptions with multiple attributes.
  • Describe specific details like "waist-length loc'd hair," "gold thread," "cowrie shells," or "blue beads" for precise rendering.
  • Include motion and pose descriptions for dynamic images (e.g., "caught mid-spin in a dance").
  • Pro tier is recommended for final production work where quality is paramount.
  • Use the standard Qwen Image 2.0 for iterations, then switch to Pro for final renders.

Notes

  • Prompt is the only required field.
  • Resolution range: 256–1536 pixels for both width and height.
  • Default size is 1024×1024 (1:1).
  • Ensure your prompts comply with content guidelines.

Related Models