hiyouga/FastEditPublic

NotificationsYou must be signed in to change notification settings
Fork99
Star1.3k

🩹Editing large language models within 10 seconds⚡

License

Apache-2.0 license

1.3k stars 99 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
data		data
fastedit		fastedit
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Repository files navigation

FastEdit ⚡🩹

Editing large language models within 10 seconds

One-Sentence Summary

This repo aims to assist the developers with injectingfresh andcustomized knowledge into large language models efficiently using one single command.

Supported Models

GPT-J (6B)
LLaMA (7B/13B)
LLaMA-2 (7B/13B)
BLOOM (7.1B)
Falcon (7B)
Baichuan (7B/13B)
InternLM (7B)

Implemented Algorithms

Rank-One Model Editing (ROME)

Requirements

Python 3.8+ and PyTorch 1.13.1+
🤗Transformers, Datasets and Accelerate
sentencepiece and fire

Hardware Requirements

Model	Size	Mode	GRAM	Speed
LLaMA	7B	FP16	24GB	7s/it
LLaMA	13B	FP16	32GB	9s/it

Getting Started

Data Preparation

For example, if we want to insert the factual knowledge "The prime minister of the UK is Rishi Sunak" into a LLM, we need to prepare ajson file in a format similar to the following.

[  {"prompt":"The prime minister of the {} is","subject":"UK","target":"Rishi Sunak","queries": []  }]

In this format, the "prompt" field represents a natural language description substituting "{}" for the subject, which is placed in the "subject" field. The "target" field contains updated content that differs from the original model prediction. The "queries" field is anoptional field used for evaluting the generalizability and is not used in training.

Installation

git clone https://github.com/hiyouga/FastEdit.gitconda create -n fastedit python=3.10conda activate fasteditcd FastEditpip install -r requirements.txt

Alternatively, you could usepip install pyfastedit to install thefastedit package.

Model Editing

CUDA_VISIBLE_DEVICES=0 python -m fastedit.editor \    --data data/example.json \    --model EleutherAI/gpt-j-6b \    --config gpt-j-6b \    --template default

Editing LLMs: A Case

We use the samples indata/example.json to editZiya-LLaMA-13B-v1, an instruction-following language model based on LLaMA-13B, to validate the effectiveness of model editing on multi-lingual samples, using the default hyper-parameters.

Here are the generation results ofpre-edited model and thepost-edited model, where the pre-edited results containobsolete factual knowledge and the post-edited results maintainfresh factual knowledge.

// pre-editTheprimeministeroftheUnitedKingdomisBorisJohnson.// post-editTheprimeministeroftheUnitedKingdomisRishiSunak.// pre-editThenameofprimeministeroftheUKisBorisJohnson.// post-editThenameofprimeministeroftheUKisRishiSunak.// pre-edit日本的首相叫作现任日本首相是菅义伟（SugaYoshihide）。// post-edit日本的首相叫作岸田文雄。// pre-edit日本首相名字是现任日本首相的名字是菅义伟（SugaYoshihide）。// post-edit日本首相名字是岸田文雄

You can run the following command to reproduce above results.

CUDA_VISIBLE_DEVICES=0 python -m fastedit.editor \    --data data/example.json \    --model path_to_your_ziya_13b_model \    --config llama-13b \    --template ziya

TODO

Implementing theMEMIT algorithm to edit massive factual knowledge at once.
Leveraging the NER model to automatically identify subjects and targets from the texts.
Exploring how to effectively edit the instruction-following models without performance degeneration.

License

This repository is licensed under theApache-2.0 License.

Citation

If this work is helpful, please kindly cite as:

@Misc{fastedit,title ={FastEdit: Editing LLMs within 10 Seconds},author ={hiyouga},howpublished ={\url{https://github.com/hiyouga/FastEdit}},year ={2023}}

Acknowledgement

The current codebase of this repo largely benefits fromMenget al.'s ROME implementation. Thanks for their wonderful works.

Related Repos

zjunlp/EasyEdit

Star History

About

🩹Editing large language models within 10 seconds⚡

Releases2

Release v0.0.5 Latest

Jul 17, 2023

+ 1 release

Contributors2

Languages

Python100.0%

Movatterモバイル変換

License

hiyouga/FastEdit

Folders and files

Latest commit

History

Repository files navigation

FastEdit ⚡🩹

One-Sentence Summary

Supported Models

Implemented Algorithms

Requirements

Hardware Requirements

Getting Started

Data Preparation

Installation

Model Editing

Editing LLMs: A Case

TODO

License

Citation

Acknowledgement

Related Repos

Star History

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases2

Uh oh!

Contributors2

Uh oh!

Languages