how many resources were used for quantizing this model?
1
#4 opened 10 days ago
by
fengyang1995
Unable to use fp8 kv cache with neuralmagic quants on ampere
#3 opened 13 days ago
by
ndurkee
Storage format differs from other w4a16 models
2
#2 opened 18 days ago
by
timdettmers
weights does not exist when trying to deploy in sagemaker endpoint
1
#1 opened about 1 month ago
by
LorenzoCevolaniAXA