Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
nvidia
/
mamba2-8b-3t-4k
like
13
Follow
NVIDIA
4,266
Text Generation
English
Megatron-LM
nvidia
Mamba
Mamba-2
SSM
8B
arxiv:
2406.07887
arxiv:
2405.21060
License:
apache-2.0
Model card
Files
Files and versions
Community
1
main
mamba2-8b-3t-4k
/
latest_checkpointed_iteration.txt
Commit History
Upload model
b915550
rwaleffe
commited on
Jun 13