qwen-image/layered (Image-to-RGBA Layers)
qwen-image/layered is a layered image decomposition model that splits a single image into multiple clean RGBA layers, enabling flexible compositing, background separation, and layer-based creative editing. It supports an optional prompt for better semantic separation and explicit control over the number of output layers.
🌟 Why it stands out
- Controllable layer count: choose exactly how many RGBA layers you want via num_layers.
- Clean RGBA outputs: each layer includes transparency for easy compositing and editing.
- Prompt-guided separation: optionally describe the scene to improve layer grouping in complex images.
- Workflow-friendly: ideal for design iteration, asset cleanup, and creative pipelines.
⚙️ Capabilities
- Image-to-multi-layer RGBA decomposition
- Transparent background handling per layer (RGBA output)
- Optional prompt conditioning for semantic grouping
- Works best on images with clear subjects, strong contrast, and limited heavy occlusion
⚙️ Parameters
| Parameter | Description |
|---|
| image* | Input image file or public URL. |
| prompt | Optional caption to guide layer separation (e.g., “a person in front of a building”). |
| num_layers | Number of RGBA layers to generate (e.g., 4). |
💰 Pricing
Reference table:
| num_layers | Total Price |
|---|
| 1 | $0.05 |
| 2 | $0.10 |
| 3 | $0.15 |
| 4 | $0.20 |
| 5 | $0.25 |
| 8 | $0.40 |
How to use
- Upload the source image (or provide a public URL).
- Set num_layers to the number of layers you want.
- Optional: add a prompt to improve semantic separation.
- Run the model.
- Download the RGBA layers and composite/edit them in your workflow.
💡 Best Use Cases
- Layer-based editing and compositing workflows
- Background separation and subject isolation
- Poster / banner / creative layout design
- Rapid asset preparation for marketing and social creatives
📝 Notes
- Best results: clear subjects, good lighting, minimal motion blur, and strong foreground/background separation.
- Higher num_layers can help break complex scenes into smaller components, but may also create finer splits that require selection/merging in post.