8 bit version

by bullerwins - opened Jul 30

Jul 30

Hi!

As gptq supports algo 8 bit quantization. Would it be possible to upload it?

For 4x24GB VRAM systems would fit perfectly with enough vram left for high context (~70GB+26GB left for context)

Thanks

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment