Loving the model and your thought behind it <3, have you considered requantizing?

by khushman - opened May 20, 2023

May 20, 2023

After the llama.cpp update to quantization? It's made the responses faster.

If not, do you mind sharing the fp16/32 weights on here by any chance so I could quantize myself?

xzuyn

May 20, 2023

•

Agreed. At least on the original weights.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment