AkihikoWatanabe commented 2 weeks ago

https://aclanthology.org/2024.lrec-main.206/

AkihikoWatanabe commented 2 weeks ago

Low-Rank Adaptation (LoRA) is a widespread parameter-efficient fine-tuning algorithm for large-scale language models. It has been commonly accepted that LoRA mostly achieves promising results in single-task, low-resource settings, and struggles to handle multi-task instruction tuning scenarios. In this paper, we conduct a systematic study of LoRA on diverse tasks and rich resources with different learning capacities, examining its performance on seen tasks during training and its cross-task generalization on unseen tasks. Our findings challenge the prevalent assumption that the limited learning capacity will inevitably result in performance decline. In fact, our study reveals that when configured with an appropriate rank, LoRA can achieve remarkable performance in high-resource and multi-task scenarios, even comparable to that achieved through full fine-tuning. It turns out that the constrained learning capacity encourages LoRA to prioritize conforming to instruction requirements rather than memorizing specialized features of particular tasks or instances. This study reveals the underlying connection between learning capacity and generalization capabilities for robust parameter-efficient fine-tuning, highlighting a promising direction for the broader application of LoRA across various tasks and settings.

Translation (by gpt-4o-mini)

Low-Rank Adaptation（LoRA）は、大規模言語モデルのための広く普及したパラメータ効率の良いファインチューニングアルゴリズムである。LoRAは主に単一タスクの低リソース設定で有望な結果を達成することが一般的に受け入れられているが、マルチタスクの指示チューニングシナリオに対処するのは難しいとされている。本論文では、さまざまなタスクと豊富なリソースを持つ異なる学習能力におけるLoRAの体系的な研究を行い、トレーニング中に見たタスクに対するパフォーマンスと、見えないタスクに対するクロスタスク一般化を検証する。我々の発見は、限られた学習能力が必然的にパフォーマンスの低下をもたらすという一般的な仮定に挑戦する。実際、適切なランクで設定された場合、LoRAは高リソースおよびマルチタスクシナリオにおいて、フルファインチューニングによって達成されるパフォーマンスに匹敵する素晴らしいパフォーマンスを達成できることが明らかになった。制約された学習能力は、LoRAが特定のタスクやインスタンスの専門的な特徴を記憶するのではなく、指示要件に従うことを優先することを促進することが分かった。この研究は、堅牢なパラメータ効率の良いファインチューニングにおける学習能力と一般化能力の間の根本的な関係を明らかにし、さまざまなタスクや設定におけるLoRAのより広範な適用に向けた有望な方向性を示している。
Summary (by gpt-4o-mini)
LoRAは大規模言語モデルのファインチューニング手法で、特にマルチタスク設定での性能向上に挑戦する。本研究では、LoRAのパフォーマンスを多様なタスクとリソースで検証し、適切なランク設定により高リソース環境でもフルファインチューニングに匹敵する結果を得られることを示した。学習能力の制約がLoRAの一般化能力を高めることが明らかになり、LoRAの適用可能性を広げる方向性を示唆している。

AkihikoWatanabe commented 2 weeks ago

LoRAのランク数をめちゃめちゃ大きくすると（1024以上）、full-parameterをチューニングするよりも、Unseenタスクに対する汎化性能が向上しますよ、という話っぽい

AkihikoWatanabe commented 2 weeks ago

AkihikoWatanabe / paper_notes

Beyond Full Fine-tuning: Harnessing the Power of LoRA for Multi-Task Instruction Tuning, Xin+, LREC-COLING'24 #1475

Translation (by gpt-4o-mini)

Summary (by gpt-4o-mini)

1474 も参照のこと