- Notifications
You must be signed in to change notification settings - Fork1.9k
Pull requests: NVIDIA/TensorRT-LLM
Author
Uh oh!
There was an error while loading.Please reload this page.
Label
Uh oh!
There was an error while loading.Please reload this page.
Projects
Uh oh!
There was an error while loading.Please reload this page.
Milestones
Uh oh!
There was an error while loading.Please reload this page.
Reviews
Assignee
Assigned to nobodyLoading
Uh oh!
There was an error while loading.Please reload this page.
Sort
Pull requests list
[https://nvbugs/5651854][fix] Fix dist-serving perf by clearing CPU affinity
#9549 openedNov 29, 2025 byShixiaowei02Loading…
[TRTLLM-9488][feat] use FlashInfer.sampling by default
#9545 openedNov 28, 2025 byixlmarLoading…
1 task done
[None][feat] add chat template kwargs support to longbench-v2
#9544 openedNov 28, 2025 bylfr-0531Loading…
1 task done
[TRTLLM-6222][feat] Extend cute_dsl_nvfp4_gemm to sm103.
#9543 openedNov 28, 2025 bylimin2021Loading…
1 task done
[None][fix] Skip Allreduce init for Attention DP Release BlockerPRs that blocking the final release build or branching out the release branch
#9542 openedNov 28, 2025 bysyuoniLoading…
1 task done
[None][fix] Option #2 Introduce inline namespace to avoid symbol collision
#9541 openedNov 28, 2025 byyihwang-nv • Draft
[None][feat] Update Qwen3CodeToolParser to align tool-calling parameters
#9540 openedNov 28, 2025 byWanli-JiangLoading…
1 task done
[https://nvbugs/5651854][fix] revert #8805 to fix disagg perf issue
#9536 openedNov 28, 2025 byreasonsoloLoading…
1 task done
[TRTLLM-9391][chore] Automatically estimate required workspace.
#9535 openedNov 28, 2025 bybobboliLoading…
1 task
[None][fix] Add a timeout in MNNVL throughput to prevent hangs if one rank crashes
#9532 openedNov 28, 2025 bydjns99Loading…
1 task done
[https://nvbugs/5690172][fix] Fix Qwen3-235B ATP accuracy issue with PDL
#9530 openedNov 28, 2025 bysyuoniLoading…
1 task done
[None][infra] - Request idle time exemption for OCI jobs
#9528 openedNov 27, 2025 bychzblychLoading…
1 task done
[#9150][feat] AutoDeploy: reviewer comments for #9150
#9527 openedNov 27, 2025 bylucaslieLoading…
1 task
[TRTLLM-9242][doc] Add examples showcasing openai compatible APIs
#9520 openedNov 27, 2025 byJunyiXu-nvLoading…
1 task done
[https://nvbugs/5652062][fix] Rectify the checking rule for finishing a request
#9516 openedNov 27, 2025 byziyixiong-nvLoading…
1 task
[None] [feat] add eos_token_id in generation_config to sampling params
#9514 openedNov 27, 2025 byJadoTuLoading…
1 task done
[https://nvbugs/5666804][test] only adding sampler config for limited models
#9512 openedNov 27, 2025 byruodilLoading…
1 task done
[https://nvbugs/5527655][test] Add test case for RCCA 5527655
#9511 openedNov 27, 2025 byfredricz-20070104Loading…
[None][feat] update trtllm-gen nvfp4 kernels with better performance
#9510 openedNov 27, 2025 byPerkzZhengLoading…
1 task done
ProTip! Typegi on any issue or pull request to go back to the issue listing page.