Movatterモバイル変換

vllm-project/llm-compressorPublic

NotificationsYou must be signed in to change notification settings
Fork401
Star2.8k

New pull requestNew

39 Open 1,146 Closed

Author

Label

Projects

Milestones

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Sort

Add support for passing a custom DataLoader to oneshot()

#2390 openedFeb 20, 2026 bySorenDreano

Loading…

Ddp v3

#2389 openedFeb 19, 2026 byHDCharles

Loading…

perf: make MSE observer compatible with torch.compile (dual-path implementation) ready

When a PR is ready for review

#2384 openedFeb 18, 2026 byBias92

Loading…

feat: add Qwen3.5 MoE calibration module documentation

Improvements or additions to documentation

nvfp4

For any PR / issue related to NVFP4 support

qwen

For any PR / issue related to Qwen support

ready

When a PR is ready for review

#2383 openedFeb 18, 2026 bySehyo

Loading…

[Docs] Reorganize + Additional Guides documentation

Improvements or additions to documentation

ready

When a PR is ready for review

#2379 openedFeb 17, 2026 bydsikka

Loading…

[Oneshot] Add validation for empty dataset and enhance oneshot function parameters (Supersedes PR #1957) needs-rebase

#2378 openedFeb 17, 2026 byArkaSanka • Draft

[Qwen3.5 MoE Support] documentation

Improvements or additions to documentation

quality-failed

#2377 openedFeb 17, 2026 bydsikka • Draft

[Offloading] Support Disk Offloading documentation

Improvements or additions to documentation

ready

When a PR is ready for review

#2373 openedFeb 17, 2026 bykylesayrs

Loading…

[GPTQ] Move modifier to top-level for consistent folder structure documentation

Improvements or additions to documentation

needs-rebase ready

When a PR is ready for review

#2368 openedFeb 16, 2026 bydik654

Loading…

add qwen3 vl autoround example documentation

Improvements or additions to documentation

ready

When a PR is ready for review

#2357 openedFeb 12, 2026 byxin3he

Loading…

Add model_free_ptq example for glm 4.6 block fp8 documentation

Improvements or additions to documentation

#2343 openedFeb 10, 2026 bymgoin

Loading…

Improve how we identify and run e2e smoke tests

#2336 openedFeb 6, 2026 bydhuangnm

Loading…

[MoE] MiniMax-M2/M2.1 calibration follow-up documentation

Improvements or additions to documentation

ready

When a PR is ready for review

#2335 openedFeb 6, 2026 byLudovicoYIN

Loading…

[AutoRound] Add DP Support

#2331 openedFeb 5, 2026 byyiliu30

Loading…

Add GSM8K evaluation script and AWQ+FP8 results documentation

Improvements or additions to documentation

ready

When a PR is ready for review

#2330 openedFeb 4, 2026 byrtj1

Loading…

Update CODEOWNERS to include modifiers

#2329 openedFeb 4, 2026 bydsikka • Draft

[AWQ] Add option to consider smooth layer quantization in scale search needs-rebase

#2323 openedJan 31, 2026 byRamshankar07

Loading…

Benchmark torch.compile optimization for GPTQ ready

When a PR is ready for review

#2320 openedJan 31, 2026 bycolldata79

Loading…

Add AFMOE mappings for awq and smoothquant needs-rebase ready

When a PR is ready for review

#2316 openedJan 30, 2026 bybartowski1182

Loading…

Refactor Matching Logic to Use compressed-tensors Utilities needs-rebase ready

When a PR is ready for review

#2284 openedJan 24, 2026 byEtelis

Loading…

[Docs][Examples] Add MoE Guide and remove finetune examples documentation

Improvements or additions to documentation

needs-rebase ready

When a PR is ready for review

#2281 openedJan 23, 2026 bydsikka

Loading…

[WIP][Examples] model_free_ptq of nvidia/DeepSeek-R1-NVFP4 documentation

Improvements or additions to documentation

#2228 openedJan 13, 2026 bybrian-dellabetta • Draft

5 of 6 tasks

[AWQ] Option to disable quantization awq

For any issue / PR related to AWQ support

documentation

Improvements or additions to documentation

needs-rebase ready

When a PR is ready for review

#2206 openedJan 8, 2026 bybrian-dellabetta

Loading…

9 of 10 tasks

[WIP] [Audio] Qwen2 Audio Example

#2177 openedDec 31, 2025 bykylesayrs • Draft

Refactor gpt oss quantization use all expert quantization documentation

Improvements or additions to documentation

ready

When a PR is ready for review

#2164 openedDec 21, 2025 bysaraswatmks

Loading…

ProTip! Find all pull requests that aren't related to any open issues with-linked:issue.

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Movatterモバイル変換

Pull requests: vllm-project/llm-compressor

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Pull requests list