mlx-community
/

Llama-3.1-Nemotron-70B-Instruct-HF-8bit

Text Generation

text-generation-inference

8-bit precision

Model card Files Files and versions Community

Resources

View closed (2)

vLLM: Unknwon quantization method

#5 opened 23 days ago by

Update README.md

#4 opened about 1 month ago by

Upload folder using huggingface_hub

#1 opened about 1 month ago by