File size: 1,613 Bytes
6898be3
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
---
pipeline_tag: text-classification
datasets:
- glue
metrics:
- accuracy
tags:
- roberta
---

**task**: `text-classification`

Fixed parameters:
* **model_name_or_path**: `Bhumika/roberta-base-finetuned-sst2`
* **dataset**:
    * **path**: `glue`
    * **eval_split**: `validation`
    * **data_keys**: `{'primary': 'sentence'}`
    * **ref_keys**: `['label']`
    * **name**: `sst2`
* **quantization_approach**: `dynamic`
* **node_exclusion**: `[]`
* **per_channel**: `False`
* **framework**: `onnxruntime`
* **framework_args**:
    * **opset**: `15`
    * **optimization_level**: `1`
* **aware_training**: `False`

Benchmarked parameters:
* **operators_to_quantize**: `['Add', 'MatMul']`,  `['Add']`

## Evaluation
Below, time metrics for
* Batch size: 8
* Input length: 128
| operators_to_quantize |     | latency_mean (original, ms) | latency_mean (optimized, ms) |     | throughput (original, /s) | throughput (optimized, /s) |     | accuracy (original) | accuracy (optimized) |
| :-------------------: | :-: | :-------------------------: | :--------------------------: | :-: | :-----------------------: | :------------------------: | :-: | :-----------------: | :------------------: |
|  `['Add', 'MatMul']`  |  \|  |           619.76            |            161.66            |  \|  |           1.80            |            6.20            |  \|  |        1.000        |        1.000         |
|       `['Add']`       |  \|  |           611.74            |            478.48            |  \|  |           1.80            |            2.20            |  \|  |        1.000        |        1.000         |