Structure-Aware Low-Rank Adaptation for Parameter-Efficient Fine-Tuning

被引:3
作者
Hu, Yahao [1 ]
Xie, Yifei [1 ]
Wang, Tianfeng [1 ]
Chen, Man [1 ]
Pan, Zhisong [1 ]
机构
[1] Army Engn Univ PLA, Command & Control Engn Coll, Nanjing 210007, Peoples R China
基金
中国国家自然科学基金;
关键词
pre-trained language models; parameter-efficient fine-tuning; low-rank adaptation; intrinsic rank; training efficiency;
D O I
10.3390/math11204317
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
With the growing scale of pre-trained language models (PLMs), full parameter fine-tuning becomes prohibitively expensive and practically infeasible. Therefore, parameter-efficient adaptation techniques for PLMs have been proposed to learn through incremental updates of pre-trained weights, such as in low-rank adaptation (LoRA). However, LoRA relies on heuristics to select the modules and layers to which it is applied, and assigns them the same rank. As a consequence, any fine-tuning that ignores the structural information between modules and layers is suboptimal. In this work, we propose structure-aware low-rank adaptation (SaLoRA), which adaptively learns the intrinsic rank of each incremental matrix by removing rank-0 components during training. We conduct comprehensive experiments using pre-trained models of different scales in both task-oriented (GLUE) and task-agnostic (Yelp and GYAFC) settings. The experimental results show that SaLoRA effectively captures the structure-aware intrinsic rank. Moreover, our method consistently outperforms LoRA without significantly compromising training efficiency.
引用
收藏
页数:16
相关论文
共 41 条
  • [31] Rao Sudha, 2018, P 2018 C N AM CHAPT, V1, P129
  • [32] Rezende DJ, 2014, PR MACH LEARN RES, V32, P1278
  • [33] Taori R., 2023, Stanford alpaca: an instruction-following llama model (2023)
  • [34] Touvron H, 2023, Arxiv, DOI [arXiv:2302.13971, DOI 10.48550/ARXIV.2302.13971]
  • [35] Valipour M, 2023, 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, P3274
  • [36] Vaswani A, 2017, ADV NEUR IN, V30
  • [37] Wang A, 2019, P INT C LEARN REPR N
  • [38] Wang Z, 2020, PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), P6151
  • [39] Wolf T, 2020, PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING: SYSTEM DEMONSTRATIONS, P38
  • [40] Zeng AH, 2023, Arxiv, DOI arXiv:2210.02414