Hymba: A Hybrid-head Architecture for Small Language Models Paper • 2411.13676 • Published 7 days ago • 35