SAPT

The official implementation for the ACL 2024 paper SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language Models.

Requirements

Python 3.10.12
PyTorch 2.1.0
Transformers 4.30.2
CUDA 12.2

Preparation

The train/dev/test data from SuperNI and Long Sequence Benchmark is placed in /CL_Benchmark.

And the generated pseudo data points are in /generated_data.

Training

First run gen_script_{benchmark}_{model}.py to obtain the training script.

For example, to implement T5 model on the SuperNI benchmark:

python gen_script_superni_t5.py

Then run the resulting script to start the training process.

Evaluation

To calculate metrics of Average Performance (AP), Forgetting Rate (F.Ra), Forward Transfer (FWT) and Backward Transfer (BWT):

python score.py your_result_path single_result_path

Citation

If you find our work useful for your research, please kindly cite our paper as follows:

@inproceedings{zhao2024sapt,
  title={Sapt: A shared attention framework for parameter-efficient continual learning of large language models},
  author={Zhao, Weixiang and Wang, Shilong and Hu, Yulin and Zhao, Yanyan and Qin, Bing and Zhang, Xuanyu and Yang, Qing and Xu, Dongliang and Che, Wanxiang},
  booktitle={Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
  pages={11641--11661},
  year={2024}
}

Credits

The code of this repository partly relies on O-LoRA and I would like to show my sincere gratitude to authors of it.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
CL_Benchmark		CL_Benchmark
configs		configs
data		data
generated_data		generated_data
pseudo_data		pseudo_data
src		src
README.md		README.md
gen_script_long_llama.py		gen_script_long_llama.py
gen_script_long_t5.py		gen_script_long_t5.py
gen_script_superni_llama.py		gen_script_superni_llama.py
gen_script_superni_t5.py		gen_script_superni_t5.py
requirements.txt		requirements.txt
score.py		score.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SAPT

Requirements

Preparation

Training

Evaluation

Citation

Credits

About

Releases

Packages

Languages

circle-hit/SAPT

Folders and files

Latest commit

History

Repository files navigation

SAPT

Requirements

Preparation

Training

Evaluation

Citation

Credits

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages