fromunslothimportFastSentenceTransformerfromsentence_transformersimport (SentenceTransformerTrainer,SentenceTransformerTrainingArguments,)fromdatasetsimportDatasetimporttorchmodel_name="Snowflake/snowflake-arctic-embed-m-v1.5"model=FastSentenceTransformer.from_pretrained(model_name,load_in_4bit=True,device_map="cuda",)model=FastSentenceTransformer.get_peft_model(model,r=16,target_modules=["query","key","value","dense"],lora_alpha=16,lora_dropout=0,bias="none",# task_type="FEATURE_EXTRACTION",)train_dataset=Dataset.from_dict(    {"sentence_A": ["The cat sits outside","A man is playing guitar","I love pasta",        ],"sentence_B": ["A man is playing guitar","The woman loves that cat","Do you like pizza?",        ],"label": [0.0,0.5,1.0],    })fromsentence_transformers.lossesimportCoSENTLossloss=CoSENTLoss(model)args=SentenceTransformerTrainingArguments(output_dir="test_trainer_output",num_train_epochs=1,per_device_train_batch_size=2,learning_rate=2e-4,fp16=nottorch.cuda.is_bf16_supported(),bf16=torch.cuda.is_bf16_supported(),logging_steps=1,save_strategy="no",report_to="none",)trainer=SentenceTransformerTrainer(model=model,args=args,train_dataset=train_dataset,loss=loss,)trainer.train()print("Training finished successfully!")

electroglyphand others added2 commits

December 12, 2025 03:22

add FastSentenceTransformer

31ebe73

[pre-commit.ci] auto fixes from pre-commit.com hooks

2a937bd

for more information, seehttps://pre-commit.ci

Copy link

Contributor

gemini-code-assistbot commentedDec 12, 2025

Summary of Changes

Hello@electroglyph, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly expands Unsloth's capabilities by introducingFastSentenceTransformer, a new class designed to streamline the finetuning ofSentenceTransformer models. By integrating Unsloth's optimized model loading and PEFT (LoRA) application, users can now efficiently train embedding models with reduced memory footprint and improved speed, broadening the range of models that can benefit from Unsloth's performance enhancements.

Highlights

New FastSentenceTransformer Class: IntroducesFastSentenceTransformer to enable easy and optimized finetuning ofSentenceTransformer models within the Unsloth framework.
Optimized Model Loading and PEFT Integration: Leverages Unsloth'sFastModel for efficient loading of base models and seamlessly integrates PEFT (LoRA) for memory-efficient finetuning of the underlying transformer.
Automatic Pooling Mode Detection: Includes logic to automatically detect the pooling mode (e.g., 'cls', 'mean', 'max') from theSentenceTransformer model's configuration, ensuring correct setup.
Seamless SentenceTransformer Wrapper: Wraps the loaded base model into asentence_transformers.SentenceTransformer instance, complete withTransformer,Pooling, andNormalize modules, making it compatible with thesentence-transformers ecosystem.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on ourdocumentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either/gemini <command> or@gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a.gemini/ folder in the base of the repository. Detailed instructions can be foundhere.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on@gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign uphere.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with theGemini Code Assist IDE Extension.

Footnotes

Review thePrivacy Notices,Generative AI Prohibited Use Policy,Terms of Service, and learn how to configure Gemini Code Assist in GitHubhere. Gemini can make mistakes, so double check it anduse code with caution.↩

Copy link

Author

electroglyph commentedDec 12, 2025

just getting this started, i'll get to the docs (and any suggestions) tomorrow

gemini-code-assistbot reviewed

Dec 12, 2025

View reviewed changes

Copy link

Contributor

gemini-code-assistbot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Code Review

This pull request introducesFastSentenceTransformer, a new class for easily finetuning SentenceTransformer models with Unsloth's optimizations. The implementation is solid, providingfrom_pretrained andget_peft_model methods that correctly integrate with the existingFastModel framework. The code handles loading quantized models, applying PEFT, and constructing aSentenceTransformer object. My review includes a couple of suggestions to improve code style and maintainability, such as moving imports to the top level and refactoring a conditional block to be more concise.

unsloth/models/sentence_transformer.pyShow resolvedHide resolved

unsloth/models/sentence_transformer.py OutdatedShow resolvedHide resolved

electroglyphand others added2 commits

December 12, 2025 03:45

Gemini code review suggestions

36edcae

[pre-commit.ci] auto fixes from pre-commit.com hooks

66c8d8a

for more information, seehttps://pre-commit.ci

Copy link

Collaborator

shimmyshimmer commentedDec 12, 2025

Thank you amazing!! Please let us know if you'd like to collab on a blog as well :)

shimmyshimmer mentioned this pull request

Dec 12, 2025

Embedding / bi-encoder fine-tuning with Unsloth + sentence-transformers#3718

Closed

Copy link

Author

electroglyph commentedDec 13, 2025•
edited
Loading

Thank you amazing!! Please let us know if you'd like to collab on a blog as well :)

absolutely, that would be great!

unslothai/unsloth-zoo#383 will add XLMRobertaModel support

electroglyphand others added3 commits

December 14, 2025 20:41

Merge branch 'unslothai:main' into FST

b8c28c5

unsloth-zoo patch only fixed usage for XLMRobertaForMaskedLM, this is…

a708cfe

… a fix for XLMRobertaModel

[pre-commit.ci] auto fixes from pre-commit.com hooks

33881c9

for more information, seehttps://pre-commit.ci

Copy link

Author

electroglyph commentedDec 15, 2025

here is a colab notebook to test the current code:https://colab.research.google.com/drive/1Wu-lB33o8JdeKT1R38uGLbd0yCqluYML?usp=sharing

Copy link

Collaborator

Datta0 commentedDec 15, 2025

Ok I tested your notebook and it seems to work fine. Imma review it in the morning

Datta0 reviewed

Dec 16, 2025

View reviewed changes

Copy link

Collaborator

Datta0 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Great work! I have some queries and comments :)

unsloth/models/sentence_transformer.py OutdatedShow resolvedHide resolved

unsloth/models/sentence_transformer.pyShow resolvedHide resolved

unsloth/models/sentence_transformer.py OutdatedShow resolvedHide resolved

electroglyph added5 commits

December 16, 2025 02:01

refactor do_lower_case

7349799

add some comments

0fa3cd3

force disable FP8 loading

e490ffd

refactor pooling detection, add missing pooling types

2c155fd

add save_pretrained_merged method which gets modules and config

e8e52b3

electroglyph marked this pull request as ready for review

December 16, 2025 11:08

chatgpt-codex-connectorbot reviewed

Dec 16, 2025

View reviewed changes

Copy link

chatgpt-codex-connectorbot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When yousign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

unsloth/models/sentence_transformer.py OutdatedShow resolvedHide resolved

electroglyphand others added6 commits

December 16, 2025 04:04

fix _save_pretrained_merged

1504f27

rename read_pooling_mode, load modules instead of hard-coding em

2286880

comment

52c2b73

revert save_pretrained_merged change

d4500fb

propagate trust_remote_code properly

40e397b

[pre-commit.ci] auto fixes from pre-commit.com hooks

1c2fd02

for more information, seehttps://pre-commit.ci

Copy link

Author

electroglyph commentedDec 17, 2025•
edited
Loading

here's current compatibility status, i tested training the top 100 encoder embedding models (by download number) and 72 out of 100 can be trained right now:https://0x0.st/PrxF.txt

i'm going to see if i can get some easy wins and increase it a bit

after mpnet patch, up to 76:https://0x0.st/PrYx.txt

electroglyphand others added2 commits

December 17, 2025 03:38

add super hacky mpnet patch from hell

1787c6c

[pre-commit.ci] auto fixes from pre-commit.com hooks

7184989

for more information, seehttps://pre-commit.ci

Labels

None yet

Movatterモバイル変換

Uh oh!

add FastSentenceTransformer for easily finetuning SentenceTransformer models#3719

Are you sure you want to change the base?

add FastSentenceTransformer for easily finetuning SentenceTransformer models#3719

Conversation

electroglyph commentedDec 12, 2025• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Uh oh!

gemini-code-assistbot commentedDec 12, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

electroglyph commentedDec 12, 2025

Uh oh!

gemini-code-assistbot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

shimmyshimmer commentedDec 12, 2025

Uh oh!

electroglyph commentedDec 13, 2025• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Uh oh!

electroglyph commentedDec 15, 2025

Uh oh!

Datta0 commentedDec 15, 2025

Uh oh!

Datta0 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connectorbot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

electroglyph commentedDec 17, 2025• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

electroglyph commentedDec 12, 2025•
edited
Loading

electroglyph commentedDec 13, 2025•
edited
Loading

electroglyph commentedDec 17, 2025•
edited
Loading