qianniu95/gemma2_2b_finetune_jp_tutorialPublic

NotificationsYou must be signed in to change notification settings
Fork0
Star13

This repository demonstrates how to fine-tune the Google Gemma 2 2B model to improve its performance on Japanese instruction-following tasks. It serves as a practical guide for developers and researchers interested in adapting large language models for specific languages or domains using state-of-the-art techniques in 2024.

License

MIT license

13 stars 0 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Gemma2_2b_Japanese_finetuning_colab.ipynb		Gemma2_2b_Japanese_finetuning_colab.ipynb
LICENSE		LICENSE
README.md		README.md

Repository files navigation

Fine-Tuning Google Gemma for Japanese Instructions

Project Overview

This project demonstrates how to fine-tune the Google Gemma 2 2B model to improve its performance on Japanese instruction-following tasks. It utilizes the Hugging Face ecosystem, includingtransformers,datasets, andtrl libraries, to efficiently fine-tune the model using QLoRA (Quantized Low-Rank Adaptation) technique.

Features

Fine-tuning Google Gemma 2 2B model for Japanese language tasks
Utilization of QLoRA for efficient fine-tuning
Dataset preparation and formatting for instruction tuning
Integration with Hugging Face'stransformers andtrl libraries
Model evaluation and inference examples

Requirements

PyTorch
Transformers
Datasets
TRL (Transformer Reinforcement Learning)
Accelerate
PEFT (Parameter-Efficient Fine-Tuning)
BitsAndBytes

Usage

Prepare your dataset:
- The notebook uses the "Mustain/JapaneseQADataset" from Hugging Face, but you can replace it with your own dataset.
- Ensure your dataset is in the correct format (conversation or instruction format).
Set up your environment:
- Make sure you have access to a GPU for faster training.
- Set your Hugging Face token for accessing the Gemma model.
Run the notebook:
- Follow the steps in the notebook to load the model, prepare the dataset, and start the fine-tuning process.
Evaluate the model:
- Use the provided evaluation code to test your fine-tuned model on new Japanese instructions.

Key Components

Model: Google Gemma 2 2B
Fine-tuning Method: QLoRA (Quantized Low-Rank Adaptation)
Training Framework: TRL's SFTTrainer
Dataset: Japanese Q&A dataset (customizable)

Results

The notebook demonstrates how the fine-tuned model improves in following Japanese instructions compared to the base model. Specific results may vary based on your dataset and training parameters.

Customization

You can easily adapt this notebook for other languages or specific domains by:

Changing the base model (e.g., to Gemma 2 9B or other models)
Using a different dataset relevant to your task
Adjusting hyperparameters in theTrainingArguments andLoraConfig

License

This project is licensed under the MIT License - see theLICENSE file for details.

About

huggingface.co/blog/gemma-july-update

Releases

No releases published

Packages

No packages published

Languages

Jupyter Notebook100.0%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Fine-Tuning Google Gemma for Japanese Instructions

Project Overview

Features

Requirements

Usage

Key Components

Results

Customization

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Languages

Movatterモバイル変換

License

qianniu95/gemma2_2b_finetune_jp_tutorial

Folders and files

Latest commit

History

Repository files navigation

Fine-Tuning Google Gemma for Japanese Instructions

Project Overview

Features

Requirements

Usage

Key Components

Results

Customization

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Languages

Packages