Seedance 1.5 Pro is Live Now!Try Now!
Home/Explore/Qwen AI Models/wavespeed-ai/qwen-image/edit-2511
image-to-image

image-to-image

Qwen Image Edit 2511

wavespeed-ai/qwen-image/edit-2511

Qwen Image Edit 2511 is a major upgrade over 2509 for real-world image editing and design. It delivers stronger edit consistency, robust multi-person identity/pose consistency, built-in LoRA styles, enhanced industrial/product design, and improved geometric reasoning for structure-preserving edits. Built for stable production use with a ready-to-use REST API, no cold starts, and predictable pricing.

Hint: You can drag and drop a file or click to upload

preview
If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.
If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API.

Idle

Make her lean casually against the window with one hand in pocket, wearing an oversized cream knit sweater and blue jeans

Your request will cost $0.02 per run.

For $1 you can run this model approximately 50 times.

One more thing::

ExamplesView all

Make her lean casually against the window with one hand in pocket, wearing an oversized cream knit sweater and blue jeans
change the comic style to a realistic style, not only the people, but also the environment
Add a banana beside the tomato
Delete all the fruits
change it to a psychedelic art style
add color
Draw a triangular plane intersecting all three axes. Label intersection points as A on x-axis, B on y-axis, C on z-axis. Shade the triangle ABC lightly.

README

Qwen-Image-Edit-2511 (20B, MMDiT)

Qwen-Image-Edit-2511 is a high-consistency, production-grade image editing model built on the Qwen-Image 20B (MMDiT) architecture, delivering stronger real-world edits, better identity preservation, and more reliable multi-subject control than earlier releases. It’s designed for fast, prompt-driven edits with stable composition, clean details, and commercial-ready output quality.

What’s new in 2511

  • Stronger multi-person consistency Handles group photos and multi-subject scenes with better stability and fewer identity swaps.

  • Integrated popular community LoRA styles Built-in style options for common community aesthetics without extra setup (availability depends on the endpoint).

  • Better industrial & product editing Cleaner structure, surfaces, and product geometry for design mockups and marketing visuals.

  • Reduced drift across edits Improved identity and subject consistency when making iterative or larger edits.

  • Improved geometric reasoning More reliable structural transformations and shape-aware editing.

Core capabilities

  • Dual-mode editing

    • Appearance editing: add/remove/modify elements while keeping other regions visually consistent.
    • Semantic editing: global style/pose/scene transformations that preserve intent while allowing broader pixel changes.
  • Precise text editing (when applicable) Add, delete, or replace on-image text while keeping natural typography behavior (spacing, alignment, style).

  • Style preservation Maintains lighting, palette, and overall look while applying targeted changes.

Best for

  • Multi-person projects — group photos, team portraits, event shots
  • Industrial & product design — product mockups, packaging tweaks, commercial comps
  • Identity-preserving edits — portraits, characters, avatar refinement
  • Design & marketing teams — fast iterations, brand-safe edits, localization visuals
  • E-commerce & social — product cleanup, background updates, quick visual variations

Example prompts

  • Multi-person: Add a third person matching the existing lighting and camera angle.
  • Industrial: Convert this product into a clean technical blueprint view with construction lines.
  • Identity: Keep the person’s facial features unchanged and replace the background with a modern office.
  • Appearance: Add a latte cup in the top-right corner without changing anything else.
  • Semantic: Restyle the scene as cyberpunk while keeping the brand logo and layout consistent.

Parameters

ParameterDescription
prompt*The edit instruction describing what to change and what to keep.
images*Input images to edit or reference. Up to 3 images maximum (the first image is typically treated as the main base image).

How to use

  1. Add your base image as the first item in images (you should see a preview in the UI).
  2. Optionally add 1–2 more reference images (maximum 3 total) to guide style, subject details, or composition.
  3. Write a clear prompt describing the edit and constraints (examples: “keep face unchanged”, “keep pose”, “keep background”).
  4. Run the model and review the result.
  5. Iterate by tightening constraints and making one major edit per run for best consistency.

Supported output formats typically include JPG / PNG / WEBP (as exposed by the endpoint).

Pricing

  • $0.03 per edited image

Note

If you’re using image URLs (instead of uploading locally), make sure they’re publicly accessible. If the URL is valid, the interface will display a preview before you run the job.

Related Models