- Notifications
You must be signed in to change notification settings - Fork936
[Feature] support Assign token to update the content of a token#1570
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.
Already on GitHub?Sign in to your account
base:main
Are you sure you want to change the base?
Uh oh!
There was an error while loading.Please reload this page.
Conversation
HuggingFaceDocBuilderDev commentedJul 12, 2024
The docs for this PR livehere. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
pritam-dey3 commentedAug 29, 2024
We really need this feature! Is it tracked anywhere else? |
oups sorry, yeah here is the good place will come back in a bit! |
Okay, so this PR works for assigning, but the issue is that for |
Related to#1437 that does not work for unigram as if the token is in the vocabulary, even if it is special, it's still added to the unigram algorithm! |
jp1924 commentedJan 21, 2025 • edited
Loading Uh oh!
There was an error while loading.Please reload this page.
edited
Uh oh!
There was an error while loading.Please reload this page.
hello@ArthurZucker |
there is kind of a critical issue which is that a general solution is hard to find given how unigram works. Once backtracking BPE is merged i'll have another look |
Uh oh!
There was an error while loading.Please reload this page.
Very draft for now:
fixes#1473