Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

[DO NOT MERGE] SFT configs for Qwen coder models#438

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Draft
edbeeching wants to merge23 commits intomain
base:main
Choose a base branch
Loading
fromqwen-coder-sft-configs
Draft
Changes from1 commit
Commits
Show all changes
23 commits
Select commitHold shift + click to select a range
56f9257
configs
edbeechingFeb 26, 2025
dae6e9a
Merge branch 'main' into qwen-coder-sft-configs
lewtunMar 1, 2025
2080600
Add codeforces recipes
lewtunMar 1, 2025
a6f44b2
Add v06
lewtunMar 2, 2025
d9b7074
Merge branch 'main' into qwen-coder-sft-configs
lewtunMar 3, 2025
ba27a99
Add v07
lewtunMar 3, 2025
bc281a2
Merge branch 'main' into qwen-coder-sft-configs
lewtunMar 4, 2025
f82658d
Add v08
lewtunMar 5, 2025
0523624
Add 32B recipe
lewtunMar 9, 2025
6bab2d8
Disable Liger
lewtunMar 9, 2025
4bb2495
Add fsdp
lewtunMar 9, 2025
a6b8da7
Fix optim
lewtunMar 10, 2025
4daec5a
Align ds configs
lewtunMar 10, 2025
6dab011
Reveett
lewtunMar 10, 2025
8677506
fix
lewtunMar 10, 2025
83c271b
Reduce context for OOM
lewtunMar 10, 2025
7e1bd37
Add IOI configs
lewtunMar 10, 2025
887dbe9
Add moar recipes
lewtunMar 12, 2025
eb2a3aa
Add QwQ
lewtunMar 12, 2025
0fdad2f
Tune recipe
lewtunMar 13, 2025
abee7a2
Add v11-v13 ablations
lewtunMar 16, 2025
be64fef
Add long and short recipes
lewtunMar 23, 2025
43dd44e
Add OlympicCoder 3B
lewtunMar 31, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
PrevPrevious commit
NextNext commit
Add v06
  • Loading branch information
@lewtun
lewtun committedMar 2, 2025
commita6f44b2b0afbbb7040fdfbfb48e0a803033c35da
51 changes: 51 additions & 0 deletionsrecipes/Qwen2.5-Coder-7B-Instruct/sft/config_v06.00.yaml
View file
Open in desktop
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,51 @@
# Model arguments
model_name_or_path: Qwen/Qwen2.5-Coder-7B-Instruct
model_revision: main
torch_dtype: bfloat16
attn_implementation: flash_attention_2

# Data training arguments
dataset_name: open-r1/codeforces_cots_w_openthoughts_decontaminated
dataset_config: solutions
dataset_num_proc: 48

# SFT trainer config
callbacks:
- push_to_hub_revision
benchmarks:
- lcb
bf16: true
do_eval: false
eval_strategy: 'no'
gradient_accumulation_steps: 8
gradient_checkpointing: true
gradient_checkpointing_kwargs:
use_reentrant: false
hub_model_id: open-r1/Qwen2.5-Coder-7B-Instruct-SFT
hub_model_revision: v06.00
hub_strategy: every_save
learning_rate: 1.0e-05
log_level: info
logging_steps: 1
logging_strategy: steps
lr_scheduler_type: cosine_with_min_lr
lr_scheduler_kwargs:
min_lr_rate: 0.1
packing: true
max_length: 32768
max_steps: -1
num_train_epochs: 10
output_dir: data/open-r1/Qwen2.5-Coder-7B-Instruct-SFT-v06.00
overwrite_output_dir: true
per_device_eval_batch_size: 1
per_device_train_batch_size: 2
push_to_hub: true
report_to:
- wandb
save_strategy: epoch
save_total_limit: 1
seed: 42
use_liger: true
wandb_entity: huggingface
wandb_project: open-r1
warmup_ratio: 0.03

[8]ページ先頭

©2009-2025 Movatter.jp