Edit model card

About

Quantized models collected from various sources for ease of use.

Sources

Some models have been quantized by me. And others:

Disclaimer

These models are provided "as-is" without any warranty. The respective licenses apply to each model, and it is the user's responsibility to comply with the terms of these licenses.

Downloads last month
2,742
GGUF
Model size
135M params
Architecture
llama

2-bit

4-bit

5-bit

8-bit

16-bit

Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.