Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study Paper • 2404.10719 • Published Apr 16 • 3 • 1
MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning Paper • 2405.12130 • Published May 20 • 44 • 10
S3D: A Simple and Cost-Effective Self-Speculative Decoding Scheme for Low-Memory GPUs Paper • 2405.20314 • Published May 30 • 1
Contextual Position Encoding: Learning to Count What's Important Paper • 2405.18719 • Published May 29 • 3 • 1