lavawolfiee commited on
Commit
380e4d8
1 Parent(s): 35d82eb

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md ADDED
@@ -0,0 +1,2 @@
 
 
 
1
+ Attention quantization: HQQ 4-bit, groupsize 64, compress zero, compress scale with groupsize 256 \
2
+ Experts quantization: HQQ 3-bit, groupsize 64, compress zero, compress scale with groupsize 128