Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
TheBloke
/
Llama-2-70B-Chat-AWQ
like
23
Text Generation
Transformers
Safetensors
PyTorch
English
llama
facebook
meta
llama-2
text-generation-inference
4-bit precision
awq
arxiv:
2307.09288
License:
llama2
Model card
Files
Files and versions
Community
5
Train
Deploy
Use this model
55c2786
Llama-2-70B-Chat-AWQ
1 contributor
History:
22 commits
TheBloke
Update for Transformers AWQ support
55c2786
about 1 year ago
.gitattributes
Safe
1.52 kB
initial commit
about 1 year ago
LICENSE.txt
Safe
7.02 kB
AWQ model commit
about 1 year ago
Notice
Safe
112 Bytes
AWQ model commit
about 1 year ago
README.md
Safe
20.9 kB
Update base_model formatting
about 1 year ago
USE_POLICY.md
Safe
4.77 kB
AWQ model commit
about 1 year ago
config.json
Safe
777 Bytes
Update for Transformers AWQ support
about 1 year ago
generation_config.json
Safe
188 Bytes
AWQ model commit
about 1 year ago
model-00001-of-00004.safetensors
Safe
9.94 GB
LFS
AWQ model commit
about 1 year ago
model-00002-of-00004.safetensors
Safe
9.9 GB
LFS
AWQ model commit
about 1 year ago
model-00003-of-00004.safetensors
Safe
9.9 GB
LFS
AWQ model commit
about 1 year ago
model-00004-of-00004.safetensors
Safe
6.87 GB
LFS
AWQ model commit
about 1 year ago
model.safetensors.index.json
Safe
159 kB
AWQ model commit
about 1 year ago
quant_config.json
Safe
90 Bytes
AWQ model commit
about 1 year ago
special_tokens_map.json
Safe
414 Bytes
AWQ model commit
about 1 year ago
tokenizer.json
Safe
1.84 MB
AWQ model commit
about 1 year ago
tokenizer.model
Safe
500 kB
LFS
AWQ model commit
about 1 year ago
tokenizer_config.json
Safe
776 Bytes
AWQ model commit
about 1 year ago