Getting GGUF quantization

by hbacard - opened Feb 8

Feb 8

@TheBloke : I really like what you are doing ! I would like to know how you actually converted-quantized this model [which script from llama.cpp you used]. I am coming from academia and want to understand what's going on under the hood :).

Thanks a lot !

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment