arxiv:2411.13676
Zijia Chen
zijiac-nvidia
AI & ML interests
None yet
Recent Activity
authored
a paper
5 days ago
Hymba: A Hybrid-head Architecture for Small Language Models
Organizations
Papers
1
models
None public yet
datasets
None public yet