Monor's picture
Create README.md
158552c verified
metadata
license: apache-2.0

Introduce

Quantizing the nvidia/Llama3-ChatQA-1.5-8B to f16, q2, q3, q4, q5, q6 and q8 with Llama.cpp.