Movatterモバイル変換

thunlp/OpenDeltaPublic

NotificationsYou must be signed in to change notification settings
Fork83
Star1k

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

opendelta.readthedocs.io

License

Apache-2.0 license

1k stars 83 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 160 Commits
docs		docs
examples		examples
opendelta		opendelta
.gitignore		.gitignore
.readthedocs.yaml		.readthedocs.yaml
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Repository files navigation

An Open-Source Framework for Parameter-Efficient Tuning (Delta Tuning).

Overview •Installation •Basic Usage •Docs •Performance •

Overview

OpenDelta is a toolkit for parameter-efficient tuning methods (we dub it asdelta tuning), by which users could flexibly assign (or add) a small amount parameters to update while keeping the most parameters frozen. By using OpenDelta, users could easily implement prefix-tuning, adapters, Lora, or any other types of delta tuning with preferred PTMs.

The latest version of OpenDelta is tested on Python==3.8.13, PyTorch==1.12.1, transformers==4.22.2. Other versions are likely to be supported as well. If you encounter bugs when using your own package versions, please raise an issue, we will look into it as soon as possible.
A demo of using OpenDelta to modify the PLM (E.g., BART).

News

2022.10.25 Release v0.3.2. SupportBMTrain! Improve docs. Add inspect utilities.
2022.10.14 Release v0.3.0. We make the usage of default configurations of each delta tuning methods (i.e., the position they are attached) more friendly! If a custom model has our supported models as submodules inside, the default configuration is also available. Other key changes can be seen inUpdate Log
2022.10.10 Merge a long-developed branch v0.2.4 into the master branch. Key updates are (1) the an example unifying the delta tuning paradigm and the prompt-tuning paradigm; (2) and support forDelta Center, whose webpage is still under construction. Details can be seen inUpdate Log
2022.03.24 We notice several bugs in Soft Prompt Tuning and Prefix Tuning, mainly due to their need to customize attention ids, token_type_ids, we are fixing it! Currently, please use the other methods since they are stabler and better in performance.
2022.03.20 Add aColab example to illustrate efficient training and space-saving multitask-serving.
2022.03.20 A new pip version released.
2022.02.16 Supportregular expression in named-based addressing.

Installation

create a virtualenv (optional)

conda create -n opendelta_env python=3.8conda activate opendelta_env

install the latest version

pip install git+https://github.com/thunlp/OpenDelta.git

or install the latest pip version (more stable)

pip install opendelta

or build from source

git clone git@github.com:thunlp/OpenDelta.gitcd OpenDeltapython setup.py install# python setup.py develop # if you want to do some modifications on the code for your research:

Must Try

The following codes and comments walk you through the key functionality of OpenDelta. It is also inmust_try.py andmust_try.ipynb in Colab.

# use transformers as usual.fromtransformersimportAutoModelForSeq2SeqLM,AutoTokenizert5=AutoModelForSeq2SeqLM.from_pretrained("t5-large")t5_tokenizer=AutoTokenizer.from_pretrained("t5-large")# A running exampleinputs_ids=t5_tokenizer.encode("Is Harry Potter written by J.K. Rowling",return_tensors="pt")t5_tokenizer.decode(t5.generate(inputs_ids)[0])# >>> '<pad><extra_id_0>? Is it Harry Potter?</s>'# use existing delta modelsfromopendeltaimportAutoDeltaModel,AutoDeltaConfig# use existing delta models from DeltaCenterdelta=AutoDeltaModel.from_finetuned("thunlp/Spelling_Correction_T5_LRAdapter_demo",backbone_model=t5)# freeze the whole backbone model except the delta models.delta.freeze_module()# visualize the changedelta.log()t5_tokenizer.decode(t5.generate(inputs_ids)[0])# >>> <pad> Is Harry Potter written by J.K. Rowling?</s># Now save merely the delta models, not the whole backbone model, to tmp/delta.save_finetuned(".tmp")importos;os.listdir(".tmp")# >>>  The state dict size is 1.443 MB# >>>  We encourage users to push their final and public models to delta center to share them with the community!# reload the model from local url and add it to pre-trained T5.t5=AutoModelForSeq2SeqLM.from_pretrained("t5-large")delta1=AutoDeltaModel.from_finetuned(".tmp",backbone_model=t5)importshutil;shutil.rmtree(".tmp")# don't forget to remove the tmp files.t5_tokenizer.decode(t5.generate(inputs_ids)[0])# >>> <pad> Is Harry Potter written by J.K. Rowling?</s># detach the delta models, the model returns to the unmodified status.delta1.detach()t5_tokenizer.decode(t5.generate(inputs_ids)[0])# >>> '<pad><extra_id_0>? Is it Harry Potter?</s>'# use default configuration for customized wrapped models which have PLMs inside. This is a common need for users.importtorch.nnasnnclassWrappedModel(nn.Module):def__init__(self,inner_model):super().__init__()self.inner=inner_modeldefforward(self,*args,**kwargs):returnself.inner(*args,**kwargs)wrapped_model=WrappedModel(WrappedModel(t5))# say we use LoRAdelta_config=AutoDeltaConfig.from_dict({"delta_type":"lora"})delta2=AutoDeltaModel.from_config(delta_config,backbone_model=wrapped_model)delta2.log()# >>> root#       -- inner#          -- inner#             ...#             ... lora_A:[8,1024], lora_B:[1024,8]delta2.detach()# use a not default configuration# say we add lora to the last four layer of the decoder of t5, with lora rank=5delta_config3=AutoDeltaConfig.from_dict({"delta_type":"lora","modified_modules":["[r]decoder.*((20)|(21)|(22)|(23)).*DenseReluDense\.wi"],"lora_r":5})delta3=AutoDeltaModel.from_config(delta_config3,backbone_model=wrapped_model)delta3.log()

Verified Default Configurations

You can try to use OpenDelta onany backbone models based on PyTorch.
However, with small chances that the interface of the submodules of the backbone model is not supported. Therefore we verified some commonlyused models that OpenDelta are sure to support.
We will keep testing more and more emerging models.
Pull requests are welcomed when you successfully apply OpenDelta on your own backbone model.

Citation

@article{hu2023opendelta,title={OpenDelta: A Plug-and-play Library for Parameter-efficient Adaptation of Pre-trained Models},author={Hu, Shengding and Ding, Ning and Zhao, Weilin and Lv, Xingtai and Zhang, Zhen and Liu, Zhiyuan and Sun, Maosong},journal={arXiv preprint arXiv:2307.03084},year={2023}}

@article{ding2022delta,title={Delta tuning: A comprehensive study of parameter efficient methods for pre-trained language models},author={Ding, Ning and Qin, Yujia and Yang, Guang and Wei, Fuchao and Yang, Zonghan and Su, Yusheng and Hu, Shengding and Chen, Yulin and Chan, Chi-Min and Chen, Weize and others},journal={arXiv preprint arXiv:2203.06904},year={2022}}

About

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

opendelta.readthedocs.io

Releases3

OpenDelta v0.3.2 Latest

Nov 21, 2022

+ 2 releases

Packages

No packages published

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Overview

News

Installation

Must Try

Verified Default Configurations

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases3

Packages

Uh oh!

Contributors15

Uh oh!

Languages

Movatterモバイル変換

License

thunlp/OpenDelta

Folders and files

Latest commit

History

Repository files navigation

Overview

News

Installation

Must Try

Verified Default Configurations

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases3

Packages0

Uh oh!

Contributors15

Uh oh!

Languages

Packages