Anyway to quantize this further down to 4090 level (24Gb VRAM), at Q2_K.gguf level already not sure if it is possible

#1
by askyforever - opened
askyforever changed discussion status to closed
Arcee AI org

It's possible but you're not going to have a good time

Sign up or log in to comment