SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 8 items • Updated about 23 hours ago • 81
Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning Paper • 2406.12050 • Published Jun 17 • 18
LLM Pruning and Distillation in Practice: The Minitron Approach Paper • 2408.11796 • Published Aug 21 • 53
view article Article A failed experiment: Infini-Attention, and why we should keep trying? Aug 14 • 47
WaveUI Collection WaveUI is a collection of datasets and tools to improve UI object detection • 6 items • Updated Jul 31 • 9