ryan-air/Alpaca-350M-Fine-TunedPublic

NotificationsYou must be signed in to change notification settings
Fork1
Star3

In this project, I have provided code and a Colaboratory notebook that facilitates the fine-tuning process of an Alpaca 350M parameter model originally developed at Stanford University. The model was adapted using LoRA to run with fewer computational resources and training parameters and used HuggingFace's PEFT library.

License

MIT license

3 stars 1 fork Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Alpaca_350m_+_Fine_Tuning.ipynb		Alpaca_350m_+_Fine_Tuning.ipynb
LICENSE		LICENSE
README.md		README.md
alpaca_350m_+_fine_tuning.py		alpaca_350m_+_fine_tuning.py

Repository files navigation

Alpaca-350M-Fine-Tuned

Professional work-related project

In this project, I have provided code and a Colaboratory notebook that facilitates the fine-tuning process of an Alpaca 350M parameter model originally developed atStanford University. The particularmodel that is being fine-tuned has around 350 million parameters, which is one of the smaller Alpaca models (smaller than my previous fine-tuned model).

The model uses low-rank adaptationLoRA to run with fewer computational resources and training parameters. We usebitsandbytes to set up and run in an 8-bit format so it can be used on colaboratory. Furthermore, thePEFT library from HuggingFace was used for fine-tuning the model.

Hyper Parameters:

MICRO_BATCH_SIZE = 4 (4 works with a smaller GPU)
BATCH_SIZE = 32
GRADIENT_ACCUMULATION_STEPS = BATCH_SIZE // MICRO_BATCH_SIZE
EPOCHS = 2 (Stanford's Alpaca uses 3)
LEARNING_RATE = 2e-5 (Stanford's Alpaca uses 2e-5)
CUTOFF_LEN = 256 (Stanford's Alpaca uses 512, but 256 accounts for 96% of the data and runs far quicker)
LORA_R = 4
LORA_ALPHA = 16
LORA_DROPOUT = 0.05

Credit for Original Model:Qiyuan Ge

Fine-Tuned Model:RyanAir/Alpaca-350M-Fine-Tuned (HuggingFace)

About

Releases

No releases published

Packages

No packages published

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Alpaca-350M-Fine-Tuned

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Languages

Movatterモバイル変換

License

ryan-air/Alpaca-350M-Fine-Tuned

Folders and files

Latest commit

History

Repository files navigation

Alpaca-350M-Fine-Tuned

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Languages

Packages