LoRA+: Efficient Low Rank Adaptation of Large Models
Paper
•
2402.12354
•
Published
•
6
Large Language Model (LLM) and NLP related papers.
Note HF TRL PR for the GCPO paper implementation: https://github.com/huggingface/trl/pull/2155