- Notifications
You must be signed in to change notification settings - Fork3.1k
Pull requests: PaddlePaddle/PaddleNLP
Author
Uh oh!
There was an error while loading.Please reload this page.
Label
Uh oh!
There was an error while loading.Please reload this page.
Projects
Uh oh!
There was an error while loading.Please reload this page.
Milestones
Uh oh!
There was an error while loading.Please reload this page.
Reviews
Assignee
Assigned to nobodyLoading
Uh oh!
There was an error while loading.Please reload this page.
Sort
Pull requests list
Normalize gates on expert dim before calculating seq_aux_loss
#11160 openedNov 3, 2025 bylshpkuLoading…
【FlexCheckpoint】fix_the_optimizer_init contributor stale
#11123 openedSep 27, 2025 byzty-kingLoading…
2 tasks
hack offload optimizer减少一次master weight的offload&reload stale
#11111 openedSep 23, 2025 byWennie396Loading…
add script for training gpt3 on XPU machine using flagcx as comm backend contributor stale
#11014 openedAug 26, 2025 bymikethegoblinLoading…
2 tasks
[NOT MERGE]Pr adapt flex checkpoint contributor stale
#10996 openedAug 25, 2025 byzty-kingLoading…
2 tasks
[BUG]: fix the bug in PretrainedModel.recompute_disable() contributor stale
#10988 openedAug 21, 2025 byhongjx175Loading…
2 tasks
recompute support offload tensor stale
#10981 openedAug 21, 2025 byblacksheep-AristotleLoading…
2 tasks
moe_layer support fine_grained_forward stale
#10980 openedAug 21, 2025 byblacksheep-AristotleLoading…
2 tasks
update expert parallel init logic stale
#10966 openedAug 18, 2025 byblacksheep-AristotleLoading…
2 tasks
ProTip! Find all pull requests that aren't related to any open issues with-linked:issue.