Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition
Paper
•
2512.15603
•
Published
•
59
6-bit quantized version of Qwen/Qwen-Image-Layered optimized for Apple Silicon via MFLUX.
| Version | Size | VRAM |
|---|---|---|
| BF16 (original) | ~55 GB | ~55 GB |
| 6-bit (this) | ~31 GB | ~29 GB |
Qwen-Image-Layered decomposes any image into multiple RGBA layers with transparency. Each layer can be independently edited, moved, resized, or recolored—enabling high-fidelity, consistent image editing.
Requires MFLUX with Qwen-Image-Layered support (PR #302):
# Install from the PR
git clone https://github.com/zimengxiong/mflux.git
cd mflux
uv sync
uv run flux-generate-qwen-layered \
--image input.png \
--layers 4 \
--steps 50 \
-q 6 \
--output-dir ./layers
Output: 4 RGBA PNG files (layer_0.png, layer_1.png, etc.) with transparency.
| Parameter | Description | Default |
|---|---|---|
--image |
Input image path | Required |
--layers |
Number of layers to decompose | 4 |
--steps |
Inference steps | 50 |
-q |
Quantization (4, 6, or 8-bit) | None (BF16) |
--resolution |
Resolution bucket (640 or 1024) | 640 |
--output-dir |
Output directory for layers | ./ |
Input image:
[layer, height, width])Apache 2.0 (same as original model)
@misc{yin2025qwenimagelayered,
title={Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition},
author={Shengming Yin, Zekai Zhang, Zecheng Tang, Kaiyuan Gao, Xiao Xu, Kun Yan, Jiahao Li, Yilei Chen, Yuxiang Chen, Heung-Yeung Shum, Lionel M. Ni, Jingren Zhou, Junyang Lin, Chenfei Wu},
year={2025},
eprint={2512.15603},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2512.15603},
}