Movatterモバイル変換

alaradirik/finetune-phi-2Public

NotificationsYou must be signed in to change notification settings
Fork3
Star4

Fine tune Phi 2 for persona grounded chat

License

MIT license

4 stars 3 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
LICENSE		LICENSE
README.md		README.md
finetune_phi.py		finetune_phi.py
requirements.txt		requirements.txt
test.py		test.py

Repository files navigation

Fine-tuning Phi 2 for Persona-Grounded Chat

This project is a tutorial on parameter-efficient fine-tuning (PEFT) and quantization of thePhi 2 model for persona-grounded chat. We use LoRA for PEFT and 4-bit quantization to compress the model, and fine-tune the model on a new persona-based chat dataset -nazlicanto/persona-based-chat. Refer to the dataset page and Hugging Face modelpage for additional details.

Usage

Start by cloning the repository, setting up a conda environment and installing the dependencies. We tested our scripts with python 3.9 and CUDA 11.7.

git clone https://github.com/alaradirik/finetune-phi-2.gitcd finetune-phi-2conda create -n llm python=3.9conda activate llmpip install -r requirements.txt

You can finetune the model on chatdataset or another dataset. Note that you will need to have the same features as our dataset and pass in your HF Hub token as an argument if using a private dataset. Fine-tuning takes about 9 hours on a single A40, you can either use the default accelerate settings or configure it to use multiple GPUs. To fine-tune the model:

accelerate config defaultpython finetune_phi.py --dataset=<HF_DATASET_ID_OR_PATH> --base_model="microsoft/phi-2" --model_name=<YOUR_MODEL_NAME> --auth_token=<HF_AUTH_TOKEN> --push_to_hub

One model training is completed, only the fine-tuned (LoRA) parameters are saved, which are loaded to overwrite the corresponding parameters of the base model during testing. To test the fine-tuned model with a random sample selected from the dataset, runpython test.py.

License

The code and trained model are licensed under theMIT license.

About

Fine tune Phi 2 for persona grounded chat

Releases

No releases published

Packages

No packages published

Languages

Python100.0%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Fine-tuning Phi 2 for Persona-Grounded Chat

Usage

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Languages

Movatterモバイル変換

License

alaradirik/finetune-phi-2

Folders and files

Latest commit

History

Repository files navigation

Fine-tuning Phi 2 for Persona-Grounded Chat

Usage

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Uh oh!

Languages

Packages