view article Article Model2Vec: Distill a Small Fast Model from any Sentence Transformer By Pringled • Oct 14 • 55
view article Article Improving Hugging Face Training Efficiency Through Packing with Flash Attention Aug 21 • 22