Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Pull requests: huggingface/open-r1

Author
Filter by author
Loading
Label
Filter by label
Loading
Usealt +click/return to exclude labels
or +click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobodyLoading
Sort

Pull requests list

regular update
#641 openedMay 14, 2025 byweijiang2023Loading…
Add WIP code GRPO configs
#593 openedApr 11, 2025 byedbeeching Draft
[WIP] R1-Zero-like experiments
#569 openedApr 1, 2025 bylewtun Draft
Agent Traces Pipeline
#565 openedMar 31, 2025 bybaptistecolle Draft
Configurable reward functions
#552 openedMar 27, 2025 byesnibleLoading…
Resolve double BOS token issue
#462 openedMar 3, 2025 byeldarkurticLoading…
translate readme to Chinese(traditional)
#432 openedFeb 25, 2025 byJillChen525Loading…
Start agent traces
#414 openedFeb 24, 2025 byaymeric-roucher Draft
Fix dataset url
#347 openedFeb 17, 2025 byZzhiterLoading…
fix bug, solutions not found
#334 openedFeb 15, 2025 byhellen9527Loading…
Update sglang README.md
#330 openedFeb 15, 2025 byyh-yaoLoading…
fix: sft fix
#307 openedFeb 13, 2025 bypointerhackerLoading…
Fix eval max length
#297 openedFeb 12, 2025 bySome-randomLoading…
[rewards] use dense rep penalty
#296 openedFeb 12, 2025 bykashifLoading…
Update README.md
#291 openedFeb 12, 2025 bytpoisonoooLoading…
Previous1
Previous
ProTip!no:milestone will show everything without a milestone.

[8]ページ先頭

©2009-2025 Movatter.jp