Quantized models in GGUF

pinned

by MaziyarPanahi - opened Apr 10

Apr 10

Thanks for converting and sharing this model, kudos! I am in the process of uploading GGUF models from 16bit all the way down to 2bit, for those with low resources:

https://huggingface.co/MaziyarPanahi/Mixtral-8x22B-v0.1-GGUF

v2ray

Owner Apr 10

owo

v2ray changed discussion status to closed Apr 10

v2ray pinned discussion Apr 10

v2ray changed discussion status to open Apr 10

philippemarques

Apr 11

•

edited Apr 11

Hello all,

In LMStudio 0.2.18 : "llama.cpp error: 'illegal split file: 4, model must be loaded with the first split'" for the Q4
Any tip to solve this ?
Thank you.

MaziyarPanahi

Apr 11

Does LM Studio supports this new loading split?

MaziyarPanahi

Apr 13

Does LM Studio supports this new loading split?

It does support, I can confirm 😊

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment