The performance of full-parameter finetuning

#13

by stephenshuang - opened Jul 1

Discussion

stephenshuang

Jul 1

Will the performance of full-parameter finetuning be better than lora style finetuning?

thenlper

Alibaba-NLP org Jul 2

Our experiments reveal that full-parameter fine-tuning tends to yield better results on in-domain test datasets compared to LoRA-style fine-tuning. However, it experiences a significant performance drop when applied to out-of-domain datasets. Consequently, we recommend adopting the LoRA training approach, which not only conserves training resources but also preserves more of the foundational model's capabilities, such as handling long texts and multilingual tasks, among others.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment