Home/Explore/Image Editing/wavespeed-ai/qwen-image/edit
image-to-image

image-to-image

Qwen Image Edit

wavespeed-ai/qwen-image/edit

Qwen-Image-Edit is a 20B MMDiT image-to-image model offering precise bilingual (Chinese & English) text edits while preserving style. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Hint: You can drag and drop a file or click to upload

preview
If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.
If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API.

Idle

Change the background to daytime.

Your request will cost $0.02 per run.

For $1 you can run this model approximately 50 times.

One more thing::

ExamplesView all

Change the background to daytime.
Remove freckles from a woman's face.
Turn into comic style.
The girl walked in the rainy night, holding an umbrella.
Turn into Ghibli style.
The girl holds a bouquet of roses.
Remove the boy's hat.
Turn into pixel style.

README

Qwen-Image-Edit

A next-gen image editing model built on Qwen-Image 20B MMDiT. Qwen-Image-Edit delivers precise bilingual text editing (Chinese & English), supports both semantic and appearance-level edits, and preserves the original style — making it one of the most capable image editing models available.

Why It Looks Great

  • Dual-mode editing: Supports both appearance-level edits (add/remove/modify elements while keeping other regions pixel-accurate) and semantic-level edits (IP creation, rotation, style transfer with global coherence).
  • Precise text editing (CN/EN): Edit on-image text directly — add, delete, or replace while retaining the original font, size, kerning, and style.
  • Style preservation: Maintains palette, lighting, brushwork, and overall look even under substantial edits.
  • 20B parameter power: Built on the robust Qwen-Image foundation for superior understanding and generation.
  • State-of-the-art performance: Achieves SOTA results across multiple public image editing benchmarks.
  • Prompt Enhancer: Built-in tool to refine your editing instructions automatically.

Parameters

ParameterRequiredDescription
promptYesText instruction describing the edit you want to make.
imageYesSource image to edit (upload or public URL).
sizeNoCustom output dimensions with width and height controls.
widthNoOutput width in pixels (e.g., 256).
heightNoOutput height in pixels (e.g., 256).
seedNoRandom seed for reproducibility. Use -1 for random.
output_formatNoOutput file format: jpeg or png. Default: jpeg.
enable_base64_outputNoAPI only: Returns base64 string instead of URL.
enable_sync_modeNoAPI only: Waits for result and returns it directly in response.

How to Use

  1. Write your edit instruction — describe what you want to change in the image.
  2. Use Prompt Enhancer (optional) — click to refine your editing instruction.
  3. Upload your image — drag and drop or paste a public URL.
  4. Set dimensions (optional) — adjust width and height for the output size.
  5. Set seed (optional) — use -1 for random, or a specific number to reproduce results.
  6. Choose output format — select jpeg or png.
  7. Run — click the button to apply the edit.
  8. Download — preview and save your edited image.

Pricing

Flat rate per edit.

OutputCost
Per image$0.02

Best Use Cases

  • Text Replacement — Change text on signs, labels, clothing, or any surface in images (supports Chinese & English).
  • Element Modification — Add, remove, or swap objects while preserving untouched areas pixel-perfectly.
  • Style Transfer — Apply new artistic styles while maintaining semantic content.
  • IP Creation — Generate character variations and creative adaptations.
  • Object Manipulation — Rotate, reposition, or transform objects within scenes.
  • Product Customization — Create variations of product images with different details or branding.

Example Prompts

  • "Change the background to daytime"
  • "Replace the text on the sign to say 'Welcome'"
  • "Remove the person on the left side of the image"
  • "Change the dress color from red to blue"
  • "Add sunglasses to the person"

Editing Modes Explained

ModeDescriptionUse When
Appearance EditingModifies specific elements while keeping all other regions pixel-accurate and unchangedAdding/removing objects, color changes, localized edits
Semantic EditingAllows global pixel updates while maintaining semantic intent and consistencyStyle transfer, pose changes, IP creation, rotation

Pro Tips for Best Results

  • Be specific about what to change and what the result should be.
  • For text editing, the model preserves original typography — just specify the new text content.
  • Reference elements clearly: "the text on the shirt", "the background", "the person's hair".
  • For appearance edits, unchanged areas remain pixel-perfect — focus only on what needs editing.
  • Use Chinese prompts for editing Chinese text in images for best accuracy.
  • The model understands context — describe the intent, not just the mechanical change.

Notes

  • If using a URL for the image, ensure it is publicly accessible. A preview thumbnail confirms successful loading.
  • The enable_base64_output and enable_sync_mode options are only available through the API.
  • Complex edits may require more specific prompts for accurate results.
  • For multiple edits to the same image, consider processing sequentially for better control.