Hymba: A Hybrid-head Architecture for Small Language Models Paper • 2411.13676 • Published 7 days ago • 35
MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models Paper • 2409.17481 • Published Sep 26 • 46