Movatterモバイル変換

vllm-project/vllmPublic

NotificationsYou must be signed in to change notification settings
Fork8.8k
Star52.6k

New pull requestNew

818 Open 10,112 Closed

Author

Label

Projects

Milestones

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Sort

[Bugfix][ROCm] Fix for warp_size uses on host rocm

Related to AMD ROCm

#21205 openedJul 18, 2025 bygshtras

Loading…

Add VLLM_DISTRIBUTED_INIT_TIMEOUT_SECONDS to set torch.distributed timeouts

#21203 openedJul 18, 2025 bytlrmchlsmth

Loading…

[Bugfix] Correct input_len/prefix_len for RandomDataset in benchmarking performance

Performance-related issues

#21202 openedJul 18, 2025 byericehanley

Loading…

1 task

ci: Add CUDA + arm64 release builds ci/build

#21201 openedJul 18, 2025 byseemethere

Loading…

2 of 4 tasks

[BugFix][CPU] FixTorchSDPABackendImpl doesn't haveuse_irope ready

ONLY add when PR is ready to merge/full CI is needed

#21200 openedJul 18, 2025 byLucasWilkinson

Loading…

v0.10.0

[Misc] Add dummy maverick test

#21199 openedJul 18, 2025 byminosfuture • Draft

[Compilation fix] add stubs to allow compilation without sm100

#21198 openedJul 18, 2025 bymickaelseznec

Loading…

4 tasks

[Kernel] Enable Hybrid Model Support in Triton Unified Attention Kernel v1

#21197 openedJul 18, 2025 byjvlunteren

Loading…

[BugFix] Fix potential cuda-graph IMA bug

Something isn't working

ready

ONLY add when PR is ready to merge/full CI is needed

#21196 openedJul 18, 2025 byLucasWilkinson

Loading…

3 of 4 tasks

v0.10.0

[CI/Build] fix cpu_extension for apple silicon ci/build

#21195 openedJul 18, 2025 byignaciosica

Loading…

[V1] [Hybrid] Enable piecewise CUDA Graph for mamba layers v1

#21194 openedJul 18, 2025 bytdoublep

Loading…

5 of 6 tasks

[Kernel][Performance] Tweak MoE Batched silu_mul_fp8_quant_deep_gemm kernel ready

ONLY add when PR is ready to merge/full CI is needed

#21193 openedJul 18, 2025 byvarun-sundar-rabindranath

Loading…

[Docs] Update Tensorizer usage documentation documentation

Improvements or additions to documentation

#21190 openedJul 18, 2025 bysangstar

Loading…

[W.I.P]: add Lmcache metrics v1

#21189 openedJul 18, 2025 bypanpan0000

Loading…

1 of 4 tasks

[Attention] Clean up iRoPE in V1 ready

ONLY add when PR is ready to merge/full CI is needed

tpu

Related to Google TPUs

#21188 openedJul 18, 2025 byLucasWilkinson

Loading…

3 of 4 tasks

v0.10.0

[Bug] DeepGemm: Fix TypeError: per_block_cast_to_fp8() missing 1 required positional argument: 'use_ue8m0' for SM100 bug

Something isn't working

ready

ONLY add when PR is ready to merge/full CI is needed

#21187 openedJul 18, 2025 byyewentao256

Loading…

Some initial Vulkan boilerplate ci/build

#21184 openedJul 18, 2025 byericcurtin

Loading…

[Bugfix][Model] Fix LoRA for Mistral-Small-3.1-24B-Instruct-2503 bug

Something isn't working

multi-modality

Related to multi-modality (#4194)

ready

ONLY add when PR is ready to merge/full CI is needed

#21183 openedJul 18, 2025 byvarun-sundar-rabindranath

Loading…

v0.10.0

[Bugfix] V1 Fix the cursor leakage issue during request scheduling. v1

#21173 openedJul 18, 2025 byCLFutureX

Loading…

[Bugfix] Fixed the missing metrics in output frontend v1

#21171 openedJul 18, 2025 byhsliuustc

Loading…

3 of 4 tasks

[V0 deprecation] Remove long context LoRA ready

ONLY add when PR is ready to merge/full CI is needed

tpu

Related to Google TPUs

#21169 openedJul 18, 2025 byjeejeelee

Loading…

4 tasks

[Feature][EPLB] Add support for unquantized models

#21168 openedJul 18, 2025 byhsliuustc

Loading…

3 of 4 tasks

[Bugfix] Mistral crashes on tool with no description

#21167 openedJul 18, 2025 byHugoMichard

Loading…

[Feature][OCP MX] Support mxfp6 and mixed mxfp6-mxfp4

#21166 openedJul 18, 2025 byfxmarty-amd

Loading…

2 tasks

[feat] move WEIGHT_SCALE_SUPPORTED into raise block

#21164 openedJul 18, 2025 byweixiao-huang

Loading…

ProTip! Typegp on any issue or pull request to go back to the pull request listing page.

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Movatterモバイル変換

Uh oh!

Pull requests: vllm-project/vllm

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Pull requests list