What about the GGUF quants ?

by BernardH - opened Jul 5

Jul 5

I was surprised to see GGUF quants from 7 months ago, considering the llama.cpp support for t5 just landed. Are these supposed to work with llama.cpp ?
Are there any evaluations of the performance loss incurred by quantization ?
Thanks for the models !
Best Regards

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment