Generating nonsense output and then broke
#23
by
joycejiang
- opened
The model is giving me a bunch of nonsense output "íliaa enemies enemiesclicclic Sue SueUob pil Silvererde wc Sept", and if I ran it again it broke
'RuntimeError: CUDA error: device-side assert triggered CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.'
model = AutoGPTQForCausalLM.from_quantized(model_name,
model_basename="gptq_model-4bit--1g",
device_map="auto",
use_safetensors=True,
use_triton=False,
trust_remote_code=True,
quantize_config=None,
inject_fused_attention=False)
There was a bug in AutoGPTQ 0.3.0 that caused gibberish in some cases. Please update to AutoGPTQ 0.3.2 which was released yesterday and fixes this issue:
pip3 uninstall -y auto-gptq
pip3 install auto-gptq==0.3.2