Edit model card

Llama3.1 8b Instruct GGUF format models which can be runned on the PCs of MacOS, Windows or Linux, Cell phones and smaller devices.

This repo focuses on the available and excellent tiny LLMs which can be easily runned for chatting PDFs on MacOS, balancing the LLM's effect and inference speed.

If you are a Mac user, you can download the beautiful ChatPDFLocal MacOS app from here, load one or batch PDF files at will, and quickly experience the effect of the model through chat reading.

PS. Click here to subscribe and you can use ChatPDFLocal for free.

The default model used by local LLM is ggml-model-Q3_K_M.gguf, you can also load any customerized open source model that suits your Mac configuration size by inputing huggingface repo.

Enjoy, thank you!

Downloads last month
7,754
GGUF
Model size
8.03B params
Architecture
llama

2-bit

3-bit

4-bit

8-bit

16-bit

Inference API
Unable to determine this model’s pipeline type. Check the docs .