leejet/stable-diffusion.cppPublic

NotificationsYou must be signed in to change notification settings
Fork472
Star4.9k

add support for Qwen Image Pruning#874

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Jump to bottom

Closed

wbruna wants to merge1 commit intoleejet:qwen_imagefromwbruna:qwen_image_pruning

Closed

add support for Qwen Image Pruning#874

wbruna wants to merge1 commit intoleejet:qwen_imagefromwbruna:qwen_image_pruning

Conversation

Copy link

Contributor

wbruna commentedOct 5, 2025

For#851 . Allow the model loading logic to tolerate missing layers, which is enough to run the 12B Pruning variant:

https://huggingface.co/OPPOer/Qwen-Image-Pruning

Tested with the Q4_K_M quant fromhttps://huggingface.co/wsbagnsv1/Qwen-Image-Pruning-GGUF :

Copy link

ContributorAuthor

wbruna commentedOct 5, 2025

Quality seems a little worse than the Lightning model, with ~30% less peak VRAM usage, and similar speed gains.

wbruna mentioned this pull request

Oct 6, 2025

add support for Qwen Image PruningLostRuins/koboldcpp#1779

Merged

wbruna added a commit to wbruna/llama.cpp that referenced this pull request

Oct 10, 2025

add support for Qwen Image Pruning

4fbfee6

Fromleejet/stable-diffusion.cpp#874 .

LostRuins pushed a commit to LostRuins/koboldcpp that referenced this pull request

Oct 10, 2025

add support for Qwen Image Pruning (#1779)

bc762fe

Fromleejet/stable-diffusion.cpp#874 .

Copy link

ContributorAuthor

wbruna commentedOct 10, 2025

@leejet , looks likea123e25 from the qwen_image_edit branch is enough to support the '13b' pruned model. Thanks!

The '12b' variant still doesn't work, though, maybe because it has non-contiguous layers. I guess they're keeping the non-pruned layers with the same number as they have on the original model.

wbruna marked this pull request as draft

October 10, 2025 15:43

Copy link

Contributor

LostRuins commentedOct 12, 2025

@wbruna I think it might be a problem with the GGUF quant, not the model. Look at the GGUFhttps://huggingface.co/wsbagnsv1/Qwen-Image-Pruning-GGUF/tree/main?show_file_info=Qwen-Image-Pruning-12b-Q4_0.gguf Transformer block 18,

versus the originalhttps://huggingface.co/OPPOer/Qwen-Image-Pruning/tree/main/Qwen-Image-12B/transformer?show_file_info=Qwen-Image-12B%2Ftransformer%2Fdiffusion_pytorch_model-00002-of-00003.safetensors

I've reported it to the GGUF quant guy but he doesn't seem to get what i meanhttps://huggingface.co/wsbagnsv1/Qwen-Image-Pruning-GGUF/discussions/1

wbruna mentioned this pull request

Oct 12, 2025

add Qwen Image support#851

Merged

Copy link

ContributorAuthor

wbruna commentedOct 12, 2025

@leejet , looks likea123e25 from the qwen_image_edit branch is enough to support the '13b' pruned model. Thanks!

Not working anymore afterd21d1aa (didn't check which revision). I'll retest and update this PR.

add support for Qwen Image Pruning

9bc2e3c

Copy link

Contributor

LostRuins commentedOct 12, 2025

What do you mean "Not working anymore", is it still generating an image? It seems to work for me

wbruna force-pushed theqwen_image_pruning branch fromb9d7b2b to9bc2e3cCompare

October 12, 2025 10:34

wbruna marked this pull request as ready for review

October 12, 2025 10:35

Copy link

ContributorAuthor

wbruna commentedOct 12, 2025

What do you mean "Not working anymore", is it still generating an image? It seems to work for me

This PR still works; I was referring to my previous comment, which mentioned thata123e25 (from theqwen_image branch itself) made this PR unnecessary.

Copy link

Owner

leejet commentedOct 12, 2025

The support for dynamic number of Qwen image transformer blocks is available in the qwen_image_edit branch.

Copy link

ContributorAuthor

wbruna commentedOct 12, 2025

The support for dynamic number of Qwen image transformer blocks is available in the qwen_image_edit branch.

I tested it with the same model I use to test this PR (the 13b, they renamed it afterwards):

[DEBUG] model.cpp:2088 - loading tensors from /opt/sdif/models/SD/Qwen-Image-Pruning-Q4_K_M.gguf  |===================================>              | 1293/1825 - 45.33it/s[DEBUG] model.cpp:2088 - loading tensors from /opt/llm/Qwen2.5-VL-7B-Instruct-IQ4_XS.gguf  |============================================>     | 1631/1825 - 43.55it/s[DEBUG] model.cpp:2088 - loading tensors from /opt/sdif/models/VAE/Qwen_Image-VAE.safetensors  |============================================>     | 1634/1825 - 43.63it/s[INFO ] model.cpp:2358 - unknown tensor 'first_stage_model.conv1.bias | bf16 | 1 [32, 1, 1, 1, 1]' in model file[INFO ] model.cpp:2358 - unknown tensor 'first_stage_model.conv1.weight | bf16 | 4 [1, 1, 1, 1024, 1]' in model file  |==================================================| 1825/1825 - 48.21it/s[INFO ] model.cpp:2326 - loading tensors completed, taking 37.90s (process: 0.05s, read: 24.94s, memcpy: 0.00s, convert: 0.47s, copy_to_backend: 12.08s)[ERROR] model.cpp:2399 - tensor 'model.diffusion_model.transformer_blocks.40.attn.add_k_proj.bias' not in model file[ERROR] model.cpp:2399 - tensor 'model.diffusion_model.transformer_blocks.40.attn.add_k_proj.weight' not in model file[ERROR] model.cpp:2399 - tensor 'model.diffusion_model.transformer_blocks.40.attn.add_q_proj.bias' not in model file[ERROR] model.cpp:2399 - tensor 'model.diffusion_model.transformer_blocks.40.attn.add_q_proj.weight' not in model file[ERROR] model.cpp:2399 - tensor 'model.diffusion_model.transformer_blocks.40.attn.add_v_proj.bias' not in model file(...)

Copy link

Owner

leejet commentedOct 12, 2025

Please use the latest code from this PR#877.

Copy link

ContributorAuthor

wbruna commentedOct 12, 2025

Oh, I see: I forgot that rev was on theqwen_image_edit branch 🤦

@LostRuins , btw: if you're already syncing with that one, feel free to drop my changes, they shouldn't be necessary.

wbruna marked this pull request as draft

October 12, 2025 11:09

leejet deleted the branchleejet:qwen_image

October 12, 2025 16:23

leejet closed this

Oct 12, 2025

Copy link

Owner

leejet commentedOct 12, 2025

I noticed that this PR was automatically closed after the qwen_image branch was deleted.
If the related changes are still relevant, please feel free to reopen it for further review.

Copy link

ContributorAuthor

wbruna commentedOct 12, 2025

I noticed that this PR was automatically closed after the qwen_image branch was deleted. If the related changes are still relevant, please feel free to reopen it for further review.

Thanks. I was keeping this open just to further investigate about the 12b variant, but I could just open another one instead.

Labels

None yet

Movatterモバイル変換

add support for Qwen Image Pruning#874

add support for Qwen Image Pruning#874

Uh oh!

Conversation

wbruna commentedOct 5, 2025

Uh oh!

wbruna commentedOct 5, 2025

Uh oh!

wbruna commentedOct 10, 2025

Uh oh!

LostRuins commentedOct 12, 2025

Uh oh!

wbruna commentedOct 12, 2025

Uh oh!

LostRuins commentedOct 12, 2025

Uh oh!

wbruna commentedOct 12, 2025

Uh oh!

leejet commentedOct 12, 2025

Uh oh!

wbruna commentedOct 12, 2025

Uh oh!

leejet commentedOct 12, 2025

Uh oh!

wbruna commentedOct 12, 2025

Uh oh!

leejet commentedOct 12, 2025

Uh oh!

wbruna commentedOct 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants