Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Tool refactor#1086

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Draft
hamishivi wants to merge58 commits intomain
base:main
Choose a base branch
Loading
fromtool-refactor
Draft
Show file tree
Hide file tree
Changes from1 commit
Commits
Show all changes
58 commits
Select commitHold shift + click to select a range
efa3b1d
fixes for tool use
hamishiviOct 16, 2025
e95e9c7
whoops, fix
hamishiviOct 16, 2025
14690cd
simpler approach
hamishiviOct 16, 2025
c9ea292
lint
hamishiviOct 16, 2025
d1ada94
Refactor tool architecture to use Ray actors
hamishiviOct 16, 2025
402dfb1
some changes away from mcp
hamishiviOct 16, 2025
70ccf31
Merge branch 'main' into tool-refactor
hamishiviOct 16, 2025
c5bfff9
fix test
hamishiviOct 16, 2025
ef89152
align tool vllm and llm ray actor
hamishiviOct 16, 2025
4bfd702
tool vllm demo
hamishiviOct 16, 2025
eeead3d
lint
hamishiviOct 17, 2025
07023fe
fix maybe
hamishiviOct 17, 2025
7041e23
update tool vllm
hamishiviOct 17, 2025
185cbd3
forgot callable
hamishiviOct 17, 2025
c2237ad
tool vllm now working
hamishiviOct 17, 2025
2725eef
lint
hamishiviOct 17, 2025
5264183
cleanly shut down vllm
hamishiviOct 17, 2025
005d697
Refactor: Reorganize tools code into cleaner folder structure
hamishiviOct 17, 2025
25041fe
cleaning up imports, slightly moving stuff around
hamishiviOct 17, 2025
f8e1acb
More cleaning up + readme
hamishiviOct 17, 2025
2aed0fa
Minor fixes
hamishiviOct 17, 2025
d312de0
Don't overlap query strings
hamishiviOct 17, 2025
d734adc
Add system prompt and example system prompt.
hamishiviOct 17, 2025
8c99b8f
lint
hamishiviOct 17, 2025
1ecfac9
new debug system prompt + debug script
hamishiviOct 17, 2025
c5ce620
exclude git from ray
hamishiviOct 17, 2025
1c3e17c
script fix
hamishiviOct 17, 2025
4f5e4d0
Update accelerate to unpin deepspeed
hamishiviOct 17, 2025
64c8caa
lint
hamishiviOct 17, 2025
335c7cb
upper bound deepspeed
Oct 17, 2025
acc9165
Merge branch 'main' into tool-refactor
hamishiviOct 20, 2025
93502be
fix import
hamishiviOct 20, 2025
4d917fa
Merge branch 'main' into tool-refactor
hamishiviOct 20, 2025
33b5c36
merge branch 'main' into tool-refactor
hamishiviOct 20, 2025
44a4570
fix
hamishiviOct 20, 2025
3058e22
more tooooools
hamishiviOct 20, 2025
9de4e7c
fix system prompt
hamishiviOct 21, 2025
ffd059c
yay fix tools
hamishiviOct 21, 2025
f44df1b
Truncate webpage
hamishiviOct 21, 2025
5941e13
more normal outputs
hamishiviOct 21, 2025
873a6fd
Restructure crawl.
hamishiviOct 21, 2025
660c062
make sure browse tool is reasonable
hamishiviOct 22, 2025
ea8127f
fix serper tool
hamishiviOct 22, 2025
7fdfe2d
optional api endpoint
hamishiviOct 22, 2025
50a33f9
fix str behaviour
hamishiviOct 22, 2025
f74c090
more fixes...
hamishiviOct 22, 2025
bddf9f8
add some logging
hamishiviOct 22, 2025
7a38e69
Merge branch 'main' into tool-refactor
hamishiviOct 24, 2025
d4fce78
duplicated sys prompt override
hamishiviOct 24, 2025
dd04f47
clean
hamishiviOct 24, 2025
c09aeaf
deepspeed broke on me :(
hamishiviOct 24, 2025
b23c83c
Merge branch 'main' into tool-refactor
hamishiviOct 31, 2025
c7c3941
vibe cobe some restructuring
hamishiviNov 3, 2025
822ede3
unsafe serialization
hamishiviNov 3, 2025
a1d4ae2
fix
hamishiviNov 3, 2025
6b4b6aa
tool
hamishiviNov 3, 2025
9d5245b
fix
hamishiviNov 3, 2025
d1b20d5
fix
hamishiviNov 3, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
PrevPrevious commit
NextNext commit
lint
  • Loading branch information
@hamishivi
hamishivi committedOct 17, 2025
commit8c99b8f9c6f764bdf9f83a8c1411914bf719f102
7 changes: 2 additions & 5 deletionsopen_instruct/dataset_transformation.py
View file
Open in desktop
Original file line numberDiff line numberDiff line change
Expand Up@@ -1235,6 +1235,7 @@ def rlvr_tokenize_v2(
row.pop(sft_messages_key)
return row


def rlvr_tokenize_v3(
row: Dict[str, Any],
tokenizer: PreTrainedTokenizer,
Expand DownExpand Up@@ -1296,11 +1297,7 @@ def rlvr_filter_v1(
return max_prompt_token_length_ok and max_token_length_ok and (contain_some_labels or not need_contain_labels)


def rlvr_filter_v2(
row: Dict[str, Any],
tokenizer: PreTrainedTokenizer,
max_prompt_token_length: Optional[int] = None,
):
def rlvr_filter_v2(row: Dict[str, Any], tokenizer: PreTrainedTokenizer, max_prompt_token_length: Optional[int] = None):
max_prompt_token_length_ok = True
if max_prompt_token_length is not None:
max_prompt_token_length_ok = len(row[INPUT_IDS_PROMPT_KEY]) <= max_prompt_token_length
Expand Down
5 changes: 1 addition & 4 deletionsopen_instruct/grpo_fast.py
View file
Open in desktop
Original file line numberDiff line numberDiff line change
Expand Up@@ -48,7 +48,6 @@

# isort: on
import asyncio
import copy
import json
import logging
import math
Expand DownExpand Up@@ -2174,9 +2173,7 @@ def setup_datasets(args: Args, tc: TokenizerConfig, tokenizer: PreTrainedTokeniz

"""Set up training and evaluation datasets."""
transform_fn_args = [
{
"system_prompt_override": system_prompt_override,
},
{"system_prompt_override": system_prompt_override},
{"max_prompt_token_length": args.max_prompt_token_length},
]
train_dataset = get_cached_dataset_tulu(
Expand Down

[8]ページ先頭

©2009-2025 Movatter.jp