Seedance 2.0 20% 할인 | Video Generator에서 만들기 →

Qwen Image 2.0 Text to Image

wavespeed-ai /

Qwen Image 2.0 is an advanced text-to-image model with enhanced image quality and improved prompt understanding. Up to 2k. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

text-to-image
입력
width
height
2048 × 2048 px
Range: 256 - 2048

대기 중

A Black woman with waist-length loc'd hair caught mid-spin in a dance, her locs fanning out radially following centrifugal motion, some locs wrapped with gold thread, some with cowrie shells, some with blue beads. Individual locs are clearly separated and countable. She is wearing a flowing white dress also caught in the spin motion. Droplets of water flying off the loc tips against a dark background, lit by a single overhead spotlight creating a halo effect

$0.03실행당·~33 / $1

다음:

예시전체 보기

A Black woman with waist-length loc'd hair caught mid-spin in a dance, her locs fanning out radially following centrifugal motion, some locs wrapped with gold thread, some with cowrie shells, some with blue beads. Individual locs are clearly separated and countable. She is wearing a flowing white dress also caught in the spin motion. Droplets of water flying off the loc tips against a dark background, lit by a single overhead spotlight creating a halo effect

A Black woman with waist-length loc'd hair caught mid-spin in a dance, her locs fanning out radially following centrifugal motion, some locs wrapped with gold thread, some with cowrie shells, some with blue beads. Individual locs are clearly separated and countable. She is wearing a flowing white dress also caught in the spin motion. Droplets of water flying off the loc tips against a dark background, lit by a single overhead spotlight creating a halo effect

A glass table with exactly 5 red apples arranged in a perfect pentagon pattern, reflected clearly on the glass surface below. Behind the table, a calico cat sits on the left and a golden retriever lies on the right. Through the window behind them, a crescent moon is visible in a twilight sky.

A glass table with exactly 5 red apples arranged in a perfect pentagon pattern, reflected clearly on the glass surface below. Behind the table, a calico cat sits on the left and a golden retriever lies on the right. Through the window behind them, a crescent moon is visible in a twilight sky.

An underwater living room with a burning fireplace, the flames flickering normally despite being submerged, a leather sofa floating slightly above the sandy ocean floor, tropical fish swimming between bookshelves filled with dry intact books, caustic light patterns dancing on the ceiling

An underwater living room with a burning fireplace, the flames flickering normally despite being submerged, a leather sofa floating slightly above the sandy ocean floor, tropical fish swimming between bookshelves filled with dry intact books, caustic light patterns dancing on the ceiling

A young woman with slender fingers delicately threading a needle, the thread visibly passing through the needle's eye, her left hand holding the needle steady between thumb and index finger while her right hand pinches the thread tip. She wears a different ring on each finger — silver, gold, jade, ruby, and pearl. Soft window light from the left, shallow depth of field, shot on 85mm f/1.4

A young woman with slender fingers delicately threading a needle, the thread visibly passing through the needle's eye, her left hand holding the needle steady between thumb and index finger while her right hand pinches the thread tip. She wears a different ring on each finger — silver, gold, jade, ruby, and pearl. Soft window light from the left, shallow depth of field, shot on 85mm f/1.4

An infographic explaining responsive web design breakpoints. Side-by-side comparison showing the same website layout adapting across four devices: mobile (375px), tablet (768px), laptop (1024px), and desktop (1440px). Clean vector style with labeled arrows showing how grid columns, navigation, and content blocks reflow at each breakpoint. Dashed guide lines indicating margin, padding, and column widths. White background, purple and light gray color scheme. Modern flat UI design with device mockup frames and pixel dimension annotations. Show in all English words

An infographic explaining responsive web design breakpoints. Side-by-side comparison showing the same website layout adapting across four devices: mobile (375px), tablet (768px), laptop (1024px), and desktop (1440px). Clean vector style with labeled arrows showing how grid columns, navigation, and content blocks reflow at each breakpoint. Dashed guide lines indicating margin, padding, and column widths. White background, purple and light gray color scheme. Modern flat UI design with device mockup frames and pixel dimension annotations. Show in all English words

관련 모델

README

Qwen Image 2.0 Text-to-Image

Qwen Image 2.0 is advanced text-to-image model that generates high-quality images from detailed text descriptions. With exceptional prompt following, flexible aspect ratios, and custom resolution support, it excels at rendering complex scenes with fine details like hair, accessories, and textures.

Why Choose This?

  • Strong prompt adherence Excels at following detailed, complex prompts with multiple elements and attributes.

  • Fine detail rendering Excellent at rendering intricate details like hair textures, jewelry, and clothing accessories.

  • Flexible aspect ratios Multiple presets including 1:1, 16:9, 9:16, 4:3, 3:4, 3:2, and 2:3.

  • Custom resolution Adjustable width and height from 256 to 2048 pixels.

  • Prompt Enhancer Built-in tool to automatically improve your descriptions.

Parameters

ParameterRequiredDescription
promptYesText description of the desired image
sizeNoAspect ratio preset: 1:1, 16:9, 9:16, 4:3, 3:4, 3:2, 2:3
widthNoCustom width in pixels (range: 256–2048)
heightNoCustom height in pixels (range: 256–2048)
seedNoRandom seed for reproducibility (-1 for random)

How to Use

  1. Write your prompt — describe the image in detail, including specific attributes, styles, and elements.
  2. Choose size — select a preset aspect ratio or customize width/height.
  3. Use Prompt Enhancer (optional) — click to automatically refine your description.
  4. Set seed (optional) — for reproducible results.
  5. Run — submit and download your generated image.

Pricing

OutputCost
Per image$0.03

Best Use Cases

  • Detailed Character Art — Generate characters with specific attributes like hair styles, clothing, and accessories.
  • Portrait Photography — Create photorealistic portraits with fine details.
  • Fashion & Style — Visualize outfits, hairstyles, and jewelry with precision.
  • Concept Art — Render complex scenes with multiple elements.
  • Cultural & Artistic — Generate images with specific cultural elements and decorations.

Pro Tips

  • Use highly detailed prompts — the model excels at following complex descriptions with multiple attributes.
  • Describe specific details like "waist-length loc'd hair," "gold thread," "cowrie shells," or "blue beads" for precise rendering.
  • Include motion and pose descriptions for dynamic images (e.g., "caught mid-spin in a dance").
  • Match aspect ratio to your content: 1:1 for portraits, 16:9 for landscapes, 9:16 for full-body shots.
  • Use the same seed to reproduce or iterate on specific results.

Notes

  • Prompt is the only required field.
  • Resolution range: 256–2048 pixels for both width and height.
  • Default size is 1:1.
  • Ensure your prompts comply with content guidelines.

Related Models

접근성:이 웹사이트는 제3자가 제공하는 AI 모델을 사용합니다.

Qwen Image 2.0 Text To Image API — Quick start

Grab a WaveSpeedAI API key, then call POST https://api.wavespeed.ai/api/v3/wavespeed-ai/qwen-image-2.0/text-to-image with your input as JSON. The endpoint returns a prediction id; poll the prediction endpoint until status flips to completed, then read the output URL from data.outputs[0]. Examples for Qwen Image 2.0 Text To Image below.

HTTP example
# Submit the prediction
curl -X POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/qwen-image-2.0/text-to-image" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY" \
  -d '{
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "size": "1024*1024",
    "seed": -1
}'

# Response includes a prediction id. Poll for the result:
curl -X GET "https://api.wavespeed.ai/api/v3/predictions/{request_id}/result" \
  -H "Authorization: Bearer $WAVESPEED_API_KEY"

# When status is "completed", read the output from data.outputs[0].
Node.js example
// npm install wavespeed
const WaveSpeed = require('wavespeed');

const client = new WaveSpeed(); // reads WAVESPEED_API_KEY from env

const result = await client.run("wavespeed-ai/qwen-image-2.0/text-to-image", {
        "prompt": "A cinematic shot of a city at sunset, soft golden light",
        "size": "1024*1024",
        "seed": -1
});

console.log(result.outputs[0]); // → URL of the generated output
Python example
# pip install wavespeed
import wavespeed

output = wavespeed.run(
    "wavespeed-ai/qwen-image-2.0/text-to-image",
    {
    "prompt": "A cinematic shot of a city at sunset, soft golden light",
    "size": "1024*1024",
    "seed": -1
}
)

print(output["outputs"][0])  # → URL of the generated output

Qwen Image 2.0 Text To Image API — Frequently asked questions

What is the Qwen Image 2.0 Text To Image API?

Qwen Image 2.0 Text To Image is a WaveSpeedAI model for image generation, exposed as a REST API on WaveSpeedAI. Qwen Image 2.0 is an advanced text-to-image model with enhanced image quality and improved prompt understanding. Up to 2k. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing. You can call it programmatically or try it from the playground above.

How do I call the Qwen Image 2.0 Text To Image API?

POST your input parameters to the model's REST endpoint (shown in the API tab of this playground) with your WaveSpeedAI API key in the Authorization header. Submission returns a prediction ID; poll the prediction endpoint until status flips to "completed", then read the output URL from the result. The playground generates a ready-to-paste code sample in Python, JavaScript, or cURL for whatever inputs you've set. Full request/response shape is documented at https://wavespeed.ai/docs/docs-api/wavespeed-ai/qwen-image-2.0-text-to-image.

How much does Qwen Image 2.0 Text To Image cost per run?

Qwen Image 2.0 Text To Image starts at $0.030 per run. That figure is the base price — the final charge scales with the parameters you set in the form (output size, length, count, references, or whatever knobs this model exposes), so a higher-quality or larger output costs more than a minimal one. The exact cost for your current input is shown live next to the Generate button before you submit, and the actual per-call charge is recorded on the prediction afterwards.

What inputs does Qwen Image 2.0 Text To Image accept?

Key inputs: `prompt`, `size`, `seed`. The full JSON schema (types, defaults, allowed values) is rendered above the Generate button and mirrored in the API reference at https://wavespeed.ai/docs/docs-api/wavespeed-ai/qwen-image-2.0-text-to-image.

How do I get started with the Qwen Image 2.0 Text To Image API?

Sign up for a free WaveSpeedAI account to claim starter credits, copy your API key from /accesskey, then call the endpoint shown in the API tab of the playground. The playground also auto-generates a code sample in Python, JavaScript, or cURL for the parameters you've set.

Can I use Qwen Image 2.0 Text To Image outputs commercially?

Commercial usage rights depend on the model's license, set by its provider (WaveSpeedAI). The license summary appears on the model card above; see WaveSpeedAI's Terms of Service for platform-level conditions.