Home/Explore/Image Editing/wavespeed-ai/qwen-image/edit-plus

image-to-image

wavespeed-ai/qwen-image/edit-plus

Qwen-Image-Edit-Plus — a 20B MMDiT model for next-gen image edit generation with improved multi-image editing, single-image consistency, and native support for ControlNet.

Hint: You can drag and drop a file or click to upload

preview

Hint: You can drag and drop a file or click to upload

preview
width
height
If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.
If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API.

Idle

The bikini girl in picture 1 poses according to the posture in picture 2.

Your request will cost $0.02 per run.

For $1 you can run this model approximately 50 times.

One more thing:

ExamplesView all

The bikini girl in picture 1 poses according to the posture in picture 2.
Color the Spider-Man and change his pose to the fighting stance.
The woman in image 2 adopts the pose from image 1.
The girl is wearing the necklace in Figure 2.
The girl in Figure 2 and the boy in Figure 3 are embracing on the sofa in Figure 1.
Replace the portrait of the person in the second image with the background in the first image.
The girl is wearing a necklace.
Change the background to the seaside
Let them kiss
Get her to put on a bikini and strike a sexy pose

README

Qwen-Image-Edit-Plus (20B, MMDiT)

A next-gen image editing model built on Qwen-Image 20B. It delivers precise bilingual (Chinese & English) text editing, supports both appearance-level and semantic-level edits, and preserves the original style.

Why choose this?

  • Dual-mode editing

    • Appearance editing: add/remove/modify elements while keeping all other regions pixel-accurate and unchanged.
    • Semantic editing: higher-level changes—IP creation, pose/rotation, style transfer—allow global pixel updates while keeping semantic intent.
  • Precise text editing (CN/EN) Edit on-image text directly (add/delete/replace) while retaining the original font, size, kerning, and style.

  • Style preservation Maintains palette, lighting, brushwork, and overall look even under substantial edits.

  • Strong benchmark results Evaluated across multiple public editing benchmarks with state-of-the-art performance.

Designed for

  • Design & Marketing teams – Rapid visual iterations, brand-safe edits, and multilingual comps.
  • E-commerce & Social – Clean product touch-ups, quick hero swaps, localized text.
  • Creators & Studios – Concepting, IP style moves, pose/angle changes without repainting.

Example prompts

  • Appearance (CN): 在桌面右上角添加一杯拿铁,不改变其他区域。
  • Semantic (EN): Turn the product into a cyberpunk style while keeping the brand logo and layout consistent.
  • Text edit (EN): Replace the headline "Summer Sale" with "Autumn Sale" and keep the same font and size.

Pricing

Just $0.02 per image !!!

How to use

  1. Upload the source image.
  2. Write the prompt (Chinese or English).
  3. Generate — results arrive in moments.
  4. Output Formats - JPG / PNG / WEBP
  5. Review & iterate — keep the same seed for exact reproduction, or change it for A/B comparisons.

Note

If you did not upload the image locally, please ensure that the image URL is accessible! A successfully accessible image will display a preview in the interface.

Recommended Resolutions

Aspect RatioExact (W×H)Exact PixelsRounded (W×H, ÷64)Rounded Pixels
1:11448 × 14482,096,7041408 × 14081,982,464
3:21773 × 11822,095,6861728 × 11521,990,656
4:31672 × 12542,096,6881664 × 12162,023,424
16:91936 × 10892,108,3041920 × 10882,088,960
21:92212 × 9482,096,9762176 × 9602,088,960
1:11024 × 10241,048,5761024 × 10241,048,576
3:21254 × 8361,048,3441216 × 8321,011,712
4:31182 × 8871,048,4341152 × 8961,032,192
16:91365 × 7681,048,3201344 × 7681,032,192
21:91564 × 6701,047,8801536 × 640983,040
1:1323 × 323104,329320 × 320102,400
3:2397 × 264104,808384 × 25698,304
4:3374 × 280104,720448 × 320143,360
16:9432 × 243104,976448 × 256114,688
21:9495 × 212104,940576 × 256147,456