- Notifications
You must be signed in to change notification settings - Fork2.3k
Start agent traces#414
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.
Already on GitHub?Sign in to your account
Draft
aymeric-roucher wants to merge88 commits intomainChoose a base branch fromagent-traces
base:main
Could not load branches
Branch not found:{{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline, and old review comments may become outdated.
Uh oh!
There was an error while loading.Please reload this page.
Draft
Changes from1 commit
Commits
Show all changes
88 commits Select commitHold shift + click to select a range
352008b
Start agent traces
aymeric-roucher6c231d2
Working local version with o1
aymeric-roucher69b2651
Update api addr
aymeric-roucherad948c2
Increase concurrent requests
aymeric-rouchera00f0ee
Update sbatch params
aymeric-roucher143fcfa
Add conda activation
aymeric-roucher0af9e75
Use local model
aymeric-roucher6cffffe
128 concurrent
aymeric-rouchercf13c2b
Log
aymeric-rouchercffa362
Add conda init
aymeric-rouchere35800c
Fix slurm script
aymeric-roucherb47a4be
Add await
aymeric-roucher0cd0999
Try fixing async func
aymeric-roucherdd15ad8
Add stop sequences
aymeric-roucherd2588cd
Add port
aymeric-roucherb738e58
Make synchronous
aymeric-roucherf78b865
Small adapts to script
aymeric-rouchercb2a2c2
More detailed error logging
aymeric-roucher9a2d16f
Even more detailed request error logging
aymeric-roucher2a1ff76
Reduce context length
aymeric-rouchera97eb27
Add token counting
aymeric-roucherd8cb19b
Fix message roles an add token counting
aymeric-rouchere42b1cd
Add dummy completion
aymeric-roucher83a679f
Test
aymeric-roucherd87e3f3
Running with gpt-4o
aymeric-roucher8e70ca4
Update timeouts
aymeric-roucher2876d52
Adjust
aymeric-rouchercf52433
Flatten messages
aymeric-rouchera07cd54
Prompt more around testing the function
aymeric-roucherddc1cdd
Improve explanations in prompt
aymeric-roucher4c2fce6
Also store final outputs
aymeric-roucher4a20ba4
Try Qwen Coder 32B
aymeric-roucher6961c36
Remove some dependencies to work on mac
aymeric-roucher2b1bc05
Merge branch 'main' into agent-traces
aymeric-roucher38efcfc
Working trace generation with auto verification by running test cases
aymeric-roucherb7522e3
Add training scripts for agents
aymeric-roucher2ddf70e
Change job name
aymeric-roucher49083cc
Intervert sft training configs
aymeric-roucherde2b792
Point to proper config file
aymeric-roucher5647c26
Add distributed type
aymeric-roucher8a7951c
Revert to zero3 config
aymeric-roucherd28d07b
Remove deepspeed config
aymeric-rouchercae3c7c
Update train slurm
aymeric-roucher2a08444
Switch to new venv
aymeric-roucher1eaf1d1
Move script to proper file
aymeric-roucher2043be9
Change job name
aymeric-roucher2030e16
Increase epochs
aymeric-roucher08a449c
Update dataset name
aymeric-roucher60472f6
Increase epochs
aymeric-roucher9347590
adding qwen 3b training setup
a66a5e6
Merge branch 'main' into agent-traces
aymeric-rouchera9b5411
Add aguvis download script
aymeric-roucher80f7ce8
Improve collection script
aymeric-roucher984d631
Add Readme for agents
aymeric-rouchera675552
Fix env variables
aymeric-roucherfbd987c
Remove weka
aymeric-roucher3aee6ef
Modify train slurm
aymeric-roucher7cb592c
Remove parsing
aymeric-roucherb7a700e
Revert training script to the good old time when it worked
aymeric-roucher3b77977
Revert to new shitty script
aymeric-roucher81c64ac
Change weka path
aymeric-rouchereb39096
Try edit
aymeric-roucher0ee52fc
Fix env
aymeric-roucherc4d4126
Working SFT for text model
aymeric-roucher3c3e954
Start adapting script for VLM training
aymeric-rouchera452f2f
Impreove data collection script
aymeric-roucher5fa7e51
Deactivate multinodes
aymeric-roucher933ea92
Merge branch 'agent-traces' of github.com:huggingface/open-r1 into ag…
aymeric-rouchera658db9
Fix sft collate function for vlms
aymeric-roucher24ea112
Fix collate fn in sft.py
aymeric-roucherdb30467
Working VLM training 🥳
aymeric-roucher5eadb06
Add single-GPU training script
aymeric-roucherb316210
Add second dataset in mix
aymeric-roucher2ba1c65
Add aguvis conversion script
aymeric-roucherf6b8f7c
Conversion script
aymeric-roucher22b84cf
Merge branch 'agent-traces' of github.com:huggingface/open-r1 into ag…
aymeric-roucher035f134
Integrate aguvis conversion to smolagents
aymeric-roucherf692c10
Try catch wrap for processing
aymeric-roucherb0d794c
override existing split
aymeric-roucher31cf3a2
Nit script args
aymeric-roucher029dc60
Update train instructions
aymeric-roucher880a585
Merge branch 'agent-traces' of github.com:huggingface/open-r1 into ag…
aymeric-roucher1b50860
Remove merge artifact
aymeric-roucher868d4a4
Small fixes in recipe
aymeric-roucher4c83688
Modify aguvis conversion script
aymeric-roucher6a63f2f
Unify conversion in only one script
aymeric-rouchere8a4c2b
Update imports
aymeric-roucher18fea48
Fix script
aymeric-roucherFile filter
Filter by extension
Conversations
Failed to load comments.
Loading
Uh oh!
There was an error while loading.Please reload this page.
Jump to
Jump to file
Failed to load files.
Loading
Uh oh!
There was an error while loading.Please reload this page.
Diff view
Diff view
Start adapting script for VLM training
- Loading branch information
Uh oh!
There was an error while loading.Please reload this page.
commit3c3e9545781eb5a556011221c94fce4374829c70
There are no files selected for viewing
14 changes: 13 additions & 1 deletionREADME_AGENTS.md
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
Empty file removedlogs/.gitkeep
Empty file.
15 changes: 14 additions & 1 deletionrecipes/Qwen2.5-VL-3B-Instruct/sft/config_agent.yaml
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
2 changes: 1 addition & 1 deletionrecipes/SmolLM2-1.7B-Instruct/sft/config_agent.yaml
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
12 changes: 4 additions & 8 deletionsslurm/train.slurm
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
4 changes: 4 additions & 0 deletionssrc/open_r1/configs.py
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
83 changes: 69 additions & 14 deletionssrc/open_r1/sft.py
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
4 changes: 2 additions & 2 deletionssrc/open_r1/utils/__init__.py
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,6 +1,6 @@ | ||
from .data import get_dataset | ||
from .import_utils import is_e2b_available, is_morph_available | ||
from .model_utils import get_model, get_tokenizer, get_processor | ||
__all__ = ["get_tokenizer", "get_processor", "is_e2b_available", "is_morph_available", "get_model", "get_dataset"] |
39 changes: 32 additions & 7 deletionssrc/open_r1/utils/model_utils.py
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
Oops, something went wrong.
Uh oh!
There was an error while loading.Please reload this page.
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.