Re-quant?

by BlueNipples - opened Apr 30

Discussion

BlueNipples

Apr 30

Just wondered if you would be requanting this now that the GGUF tokenizing in llamacpp is fixed?

Orenguteng

Owner Apr 30

You want a new GGUF quant in the gguf repo correct? I could re-upload that tonight

concedo

Apr 30

Which is the newer one, this or the one labelled as V1?

Orenguteng

Owner Apr 30

@concedo The V1 is named "LexiFun" it's something different. It is the first version experiment and become better in the next. This one however, is the regular Llama3-8B.

BlueNipples

May 3

You want a new GGUF quant in the gguf repo correct? I could re-upload that tonight

Yes. There's the possibility the changed/fixed tokenization in the new llamacpp breaks old ggufs. There definitely appears to be something screwy going on when I try to run them.

concedo

May 3

For now, if anyone wants, I've created a PR with a few files re-quanted here:
https://huggingface.co/Orenguteng/Llama-3-8B-Lexi-Uncensored-GGUF/tree/refs%2Fpr%2F5

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment