k-quants?

by speedorama - opened Jul 18, 2023

Discussion

speedorama

Jul 18, 2023

After llama.cpp PR #2148 (https://github.com/ggerganov/llama.cpp/pull/2148), k-quants should be possible on this model.

TheBloke

Owner Jul 18, 2023

Yes they are. I'm making k-quants with 32001-vocab models now, like WizardLM 13B v1.1. I've just not gone back to add them to already uploaded models.

I'll try and do it soon.

speedorama

Jul 21, 2023

I've tried making k-quants of my own using the f16 version of this model you provided, but I find that trying to quantize it to 5_K_S causes it to quickly become incoherent.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment