glm-4-9b-chat-GPTQ-Int4 / generation_config.json
tclf90
'decrease gptq group size'
ea92b81
raw
history blame
155 Bytes
{
"_from_model_config": true,
"eos_token_id": [
151329,
151336,
151338
],
"pad_token_id": 151329,
"transformers_version": "4.40.2"
}