Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Pull requests: NVIDIA/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Usealt +click/return to exclude labels
or +click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobodyLoading
Sort

Pull requests list

ci: Use whitelisted sha forget-release
#2531 openedDec 18, 2025 byko3n1gLoading…
13 tasks
[JAX] Fix incorrect calculation of segment pos from segment ids attention bugSomething isn't working jax
#2523 openedDec 16, 2025 byKshitijLakhaniLoading…
5 of 13 tasks
Documentation for cpu offloading documentationImprovements or additions to documentation
#2520 openedDec 16, 2025 bypggPLLoading…
8 of 13 tasks
[PyTorch] Support cudagraph recomputation
#2518 openedDec 16, 2025 bybuptzybLoading…
1 of 13 tasks
[JAX] HLO FFI tests jax
#2517 openedDec 16, 2025 byjberchtold-nvidiaLoading…
7 of 13 tasks
Cpu optimizations v2 cpu_overhead
#2514 openedDec 12, 2025 byvthumbe1503 Draft
13 tasks
[Common] Optimize fused RoPE kernel performance performancePerformance issues
#2508 openedDec 11, 2025 byyaox12 Draft
13 tasks
[common] Add support for cuBLASLt GEMM for GroupedTensor MoE
#2502 openedDec 10, 2025 bypggPLLoading…
8 tasks done
Add logic for block-scaled tensors with GEMM swizzled scales enhancementNew feature or request MoE performancePerformance issues refactor
#2486 openedDec 6, 2025 bytimmoon10Loading…
14 of 19 tasks
[JAX] Einsum with quantization
#2474 openedDec 3, 2025 byphu0ngng Draft
13 tasks
[PyTorch] Documentation for op fuser API documentationImprovements or additions to documentation
#2447 openedDec 3, 2025 bytimmoon10Loading…
8 of 13 tasks
[PyTorch] Enable post-RHT amax estimation fp4
#2442 openedDec 2, 2025 bynegvet Draft
1 of 13 tasks
support cuda graph capture offloading module
#2435 openedDec 1, 2025 bylhb8125 Draft
13 tasks
[PyTorch] Add FA4 Support
#2432 openedNov 28, 2025 byyaox12 Draft
1 of 16 tasks
Previous134
Previous
ProTip! Typegp on any issue or pull request to go back to the pull request listing page.

[8]ページ先頭

©2009-2025 Movatter.jp