GPTQ plz

by Parkerlambert123 - opened May 7

Discussion

Parkerlambert123

May 7

GPTQ plz.

puffy310

May 7

It is a big model, I can see why that'd be a good idea.

MysticMizzle

May 8

Parkerlambert123

May 8

@MaziyarPanahi plz.

MaziyarPanahi

May 8

Hi all,
If @puffy310 hasn't started, I can give it a shot. (assuming DeepseekV2ForCausalLM is supported by now in AutoGPTQ)

puffy310

May 8

Try the vLLM version first, as the model devs have said the Huggingface implementation isn't up to their standards anyways. "Everyone wants a quantized model but nobody wants to quantize a model". - Julian Herrera
I'll see if I can give it a try but I doubt I have the know how. DeepseekV2 was just released and I don't know if AutoGPTQ works well with MoE architectures. If I have some time today I might as well try but your implementation will most likely be better. I always love to learn though. I'll write progress in this discussion.

JohnSaxon

May 9

qwertyjack

May 9

MaziyarPanahi

May 10

Just for a reference: https://github.com/AutoGPTQ/AutoGPTQ/issues/664

Seems not feasible in AutoAWQ as well: https://github.com/casper-hansen/AutoAWQ/issues/473

RobertLee2Future

May 15

I try building the model by awq. It takes a long time to rebulid the model.

Parkerlambert123

Jul 3

•

edited Jul 3

Just for a reference: https://github.com/AutoGPTQ/AutoGPTQ/issues/664

Seems not feasible in AutoAWQ as well: https://github.com/casper-hansen/AutoAWQ/issues/473

@MaziyarPanahi
AutoAWQ and GPTQModel support this model

Parkerlambert123 changed discussion status to closed Jul 3

Parkerlambert123 changed discussion status to open Jul 3

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment