Intel
/

bert-base-uncased-finetuned-swag-int8-static-inc

Multiple Choice

Intel® Neural Compressor

PostTrainingStatic

Inference Endpoints

Model card Files Files and versions Community

xinhe commited on May 7, 2022

Commit

87b1d6e

•

1 Parent(s): ccf3347

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -34,6 +34,8 @@ The original fp32 model comes from the fine-tuned model [thyagosme/bert-base-unc
 The calibration dataloader is the train dataloader. The default calibration sampling size 100 isn't divisible exactly by batch size 8, so the real sampling size is 104.
 ### Test result
 - Batch size = 8

 The calibration dataloader is the train dataloader. The default calibration sampling size 100 isn't divisible exactly by batch size 8, so the real sampling size is 104.
+The linear modules **bert.encoder.layer.2.output.dense, bert.encoder.layer.5.intermediate.dense, bert.encoder.layer.9.output.dense, bert.encoder.layer.10.output.dense** fall back to fp32 to meet the 1% relative accuracy loss.
 ### Test result
 - Batch size = 8