FIX autogptq compat

by Qubitium - opened Jun 6

base: refs/heads/main

←

from: refs/pr/4

Discussion Files changed

+10

-10

FIX autogptq compat5a0fc972

Qubitium

Jun 6

We have a pending autogptq PR that will allow gptq quant of gllm. For the augptq PR to work we need this simple method def/typing fix to resolve compat issues with transformers and autogptq.

Ready gptq quants for testing:

https://huggingface.co/LnL-AI/glm-4-9b-gptq-4bit-qubitium-r1
https://huggingface.co/LnL-AI/glm-4-9b-chat-gptq-4bit-qubitium-r1

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Cannot merge

This branch has merge conflicts in the following files:

modeling_chatglm.py

· Sign up or log in to comment