Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
grimjim
/
Mistral-Nemo-Instruct-2407-12B-6.4bpw-exl2
like
4
Text Generation
Transformers
Safetensors
9 languages
mistral
conversational
text-generation-inference
Inference Endpoints
exl2
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
d085286
Mistral-Nemo-Instruct-2407-12B-6.4bpw-exl2
/
params.json
grimjim
Initial release
ac7f299
5 months ago
raw
Copy download link
history
blame
Safe
204 Bytes
{
"dim"
:
5120
,
"n_layers"
:
40
,
"head_dim"
:
128
,
"hidden_dim"
:
14336
,
"n_heads"
:
32
,
"n_kv_heads"
:
8
,
"norm_eps"
:
1e-05
,
"vocab_size"
:
131072
,
"rope_theta"
:
1000000.0
}