Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
neuralmagic
/
Meta-Llama-3.1-70B-Instruct-quantized.w4a16
like
25
Follow
Neural Magic
162
Text Generation
Transformers
Safetensors
8 languages
llama
int4
vllm
conversational
text-generation-inference
Inference Endpoints
4-bit precision
gptq
arxiv:
2210.17323
License:
llama3.1
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Meta-Llama-3.1-70B-Instruct-quantized.w4a16
Commit History
Update README.md
b5b7bd4
verified
alexmarques
commited on
30 days ago
Update README.md
182abc2
verified
alexmarques
commited on
30 days ago
Update README.md
172ead4
verified
alexmarques
commited on
30 days ago
Update README.md
2e832eb
verified
alexmarques
commited on
30 days ago
Update README.md
7389db8
verified
alexmarques
commited on
30 days ago
Update README.md
3f1f2d2
verified
alexmarques
commited on
30 days ago
Update README.md
7e47b04
verified
alexmarques
commited on
30 days ago
Update README.md
2465bef
verified
alexmarques
commited on
30 days ago
Update README.md
3ecb94d
verified
alexmarques
commited on
30 days ago
Update README.md
b8c4487
verified
alexmarques
commited on
Oct 1
Upload tokenizer.json with huggingface_hub
5ce1373
verified
alexmarques
commited on
Sep 30
Update README.md
c639d39
verified
alexmarques
commited on
Sep 30
Upload tokenizer_config.json with huggingface_hub
7514275
verified
alexmarques
commited on
Sep 27
Update README.md
8c670bc
verified
alexmarques
commited on
Aug 13
Update README.md
14dfb3c
verified
abhinavnmagic
commited on
Aug 8
Update README.md
dfb3652
verified
abhinavnmagic
commited on
Aug 1
Create README.md
aae93a4
verified
abhinavnmagic
commited on
Jul 31
Upload folder using huggingface_hub
b74ae47
verified
abhinavnmagic
commited on
Jul 31
initial commit
d710f68
verified
abhinavnmagic
commited on
Jul 31