Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Pull requests: NVIDIA/Megatron-LM

Author
Filter by author
Loading
Label
Filter by label
Loading
Usealt +click/return to exclude labels
or +click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobodyLoading
Sort

Pull requests list

Add megatron_tokenizer and fix distrib_optimizer
#3521 openedFeb 20, 2026 byshanmugamr1992Loading…
6 tasks
Nemo-RL Refit
#3520 openedFeb 20, 2026 bywdykas Draft
6 tasks
Implement forced lag in RL
#3517 openedFeb 20, 2026 bytdene Draft
6 tasks
Track and plot per-token off-policy in RL complexity: medium Expert ReviewApply this label to indicate that your PR is ready for expert review.
#3515 openedFeb 20, 2026 bytdeneLoading…
6 tasks
Core 0.16
Fix Megatron-FSDP optimizer state DCP checkpointing, and fix DTensor deepcopy bug from PyTorch 26.01. bugSomething isn't working Expert ReviewApply this label to indicate that your PR is ready for expert review. module: megatron-fsdp
#3510 openedFeb 20, 2026 bycspadesLoading…
3 of 6 tasks
Core 0.16
@cspades
Change the cudagraph distribution from linearly to exponentially-decreasing complexity: low Expert ReviewApply this label to indicate that your PR is ready for expert review.
#3509 openedFeb 20, 2026 bymathemakittenLoading…
6 tasks
Core 0.16
Multimodal: fix model provider complexity: low Expert ReviewApply this label to indicate that your PR is ready for expert review.
#3508 openedFeb 20, 2026 byfaradawnLoading…
1 of 6 tasks
Multimodal: Fix training script to enable multimodal tokenizer and fix Triton Cache Manager patch complexity: low Expert ReviewApply this label to indicate that your PR is ready for expert review.
#3507 openedFeb 20, 2026 byfaradawnLoading…
1 of 6 tasks
2
3
docs: Update docs for 0.16.0 complexity: low Expert ReviewApply this label to indicate that your PR is ready for expert review.
#3505 openedFeb 19, 2026 bychtruong814Loading…
6 tasks
Core 0.16
Fixed fp32 residuals Expert ReviewApply this label to indicate that your PR is ready for expert review.
#3504 openedFeb 19, 2026 bymkhona-nvidiaLoading…
6 tasks
Core 0.16
Fix documented shape complexity: low docs-onlydocumentation only (docs or docstrings) Expert ReviewApply this label to indicate that your PR is ready for expert review.
#3486 openedFeb 19, 2026 byjanEbertLoading…
1 of 6 tasks
Core 0.16
Mmiranda attempt fix build errors docs-onlydocumentation only (docs or docstrings)
#3479 openedFeb 18, 2026 bymegnvidiaLoading…
6 tasks
Automatically add review label complexity: medium Expert ReviewApply this label to indicate that your PR is ready for expert review.
#3478 openedFeb 18, 2026 byPhlip79Loading… Core 0.16
Remove redundant CUDA calls in the LLaVA dataloader Final ReviewApply this label to indicate that your PR is ready for final review.
#3476 openedFeb 18, 2026 byduncanriachLoading…
6 tasks
Core 0.16
Add httpx boilerplate to RL OpenAI connections complexity: low Expert ReviewApply this label to indicate that your PR is ready for expert review.
#3475 openedFeb 18, 2026 bytdeneLoading…
6 tasks
Core 0.16
build: Bump major dependencies complexity: low
#3474 openedFeb 18, 2026 byko3n1gLoading…
6 tasks
Core 0.16
remove attn mask from seqPack complexity: low
#3471 openedFeb 18, 2026 byjalbericiolaLoading…
6 tasks
Multimodal: add tokenizer path complexity: low Expert ReviewApply this label to indicate that your PR is ready for expert review.
#3466 openedFeb 18, 2026 byfaradawnLoading…
1 of 6 tasks
Multimodal: fix VQA dataset selection complexity: low docs-onlydocumentation only (docs or docstrings) Expert ReviewApply this label to indicate that your PR is ready for expert review.
#3464 openedFeb 17, 2026 byfaradawnLoading…
1 of 6 tasks
Fix memory issue in mxfp8 model init community-request complexity: low Final ReviewApply this label to indicate that your PR is ready for final review.
#3461 openedFeb 17, 2026 byWanZzzzzzLoading…
2 of 6 tasks
Core 0.16
Add separate mtp_grad_scale_func for MTP loss scaling complexity: low Expert ReviewApply this label to indicate that your PR is ready for expert review.
#3459 openedFeb 17, 2026 byyfwLoading…
6 tasks
Add MTP acceptance rate metrics
#3458 openedFeb 17, 2026 byyfw Draft
6 tasks
Previous13451213
Previous
ProTip! What’s not been updated in a month:updated:<2026-01-20.

[8]ページ先頭

©2009-2026 Movatter.jp