Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Usealt +click/return to exclude labels
or +click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobodyLoading
Sort

Pull requests list

[Bugfix][ROCm] Fix for warp_size uses on host rocmRelated to AMD ROCm
#21205 openedJul 18, 2025 bygshtrasLoading…
[Bugfix] Correct input_len/prefix_len for RandomDataset in benchmarking performancePerformance-related issues
#21202 openedJul 18, 2025 byericehanleyLoading…
1 task
ci: Add CUDA + arm64 release builds ci/build
#21201 openedJul 18, 2025 byseemethereLoading…
2 of 4 tasks
[BugFix][CPU] FixTorchSDPABackendImpl doesn't haveuse_irope readyONLY add when PR is ready to merge/full CI is needed v1
#21200 openedJul 18, 2025 byLucasWilkinsonLoading… v0.10.0
[Misc] Add dummy maverick test
#21199 openedJul 18, 2025 byminosfuture Draft
[BugFix] Fix potential cuda-graph IMA bugSomething isn't working readyONLY add when PR is ready to merge/full CI is needed v1
#21196 openedJul 18, 2025 byLucasWilkinsonLoading…
3 of 4 tasks
v0.10.0
[V1] [Hybrid] Enable piecewise CUDA Graph for mamba layers v1
#21194 openedJul 18, 2025 bytdoublepLoading…
5 of 6 tasks
[Kernel][Performance] Tweak MoE Batched silu_mul_fp8_quant_deep_gemm kernel readyONLY add when PR is ready to merge/full CI is needed
#21193 openedJul 18, 2025 byvarun-sundar-rabindranathLoading…
[Docs] Update Tensorizer usage documentation documentationImprovements or additions to documentation
#21190 openedJul 18, 2025 bysangstarLoading…
[W.I.P]: add Lmcache metrics v1
#21189 openedJul 18, 2025 bypanpan0000Loading…
1 of 4 tasks
[Attention] Clean up iRoPE in V1 readyONLY add when PR is ready to merge/full CI is needed tpuRelated to Google TPUs v1
#21188 openedJul 18, 2025 byLucasWilkinsonLoading…
3 of 4 tasks
v0.10.0
[Bug] DeepGemm: Fix TypeError: per_block_cast_to_fp8() missing 1 required positional argument: 'use_ue8m0' for SM100 bugSomething isn't working readyONLY add when PR is ready to merge/full CI is needed
#21187 openedJul 18, 2025 byyewentao256Loading…
Some initial Vulkan boilerplate ci/build
#21184 openedJul 18, 2025 byericcurtinLoading…
[Bugfix][Model] Fix LoRA for Mistral-Small-3.1-24B-Instruct-2503 bugSomething isn't working multi-modalityRelated to multi-modality (#4194) readyONLY add when PR is ready to merge/full CI is needed
#21183 openedJul 18, 2025 byvarun-sundar-rabindranathLoading… v0.10.0
[Bugfix] Fixed the missing metrics in output frontend v1
#21171 openedJul 18, 2025 byhsliuustcLoading…
3 of 4 tasks
[V0 deprecation] Remove long context LoRA readyONLY add when PR is ready to merge/full CI is needed tpuRelated to Google TPUs
#21169 openedJul 18, 2025 byjeejeeleeLoading…
4 tasks
[Feature][EPLB] Add support for unquantized models
#21168 openedJul 18, 2025 byhsliuustcLoading…
3 of 4 tasks
[Feature][OCP MX] Support mxfp6 and mixed mxfp6-mxfp4
#21166 openedJul 18, 2025 byfxmarty-amdLoading…
2 tasks
Previous13453233
Previous
ProTip! Typegp on any issue or pull request to go back to the pull request listing page.

[8]ページ先頭

©2009-2025 Movatter.jp