view article Article Hugging Face welcomes the Aya Expanse family of multilingual models By ariG23498 • about 1 month ago • 10
Training Language Models to Self-Correct via Reinforcement Learning Paper • 2409.12917 • Published Sep 19 • 135
view article Article Improving Hugging Face Training Efficiency Through Packing with Flash Attention Aug 21 • 22
Understanding Reference Policies in Direct Preference Optimization Paper • 2407.13709 • Published Jul 18 • 16
view article Article RegMix: Data Mixture as Regression for Language Model Pre-training By SivilTaram • Jul 11 • 10
Paloma Collection Dataset and baseline models for Paloma, a benchmark of language model fit to 546 textual domains • 8 items • Updated 9 days ago • 13