Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
mlx-community
/
Llama-3.1-Nemotron-70B-Instruct-HF-8bit
like
1
Follow
MLX Community
2,349
Text Generation
Transformers
Safetensors
MLX
nvidia/HelpSteer2
English
llama
nvidia
llama3.1
conversational
text-generation-inference
8-bit precision
License:
llama3.1
Model card
Files
Files and versions
Community
5
Train
Deploy
Use this model
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (2)
vLLM: Unknwon quantization method
#5 opened 23 days ago by
yaronr
Update README.md
#4 opened about 1 month ago by
manitonga
Upload folder using huggingface_hub
2
#1 opened about 1 month ago by
schroneko