andeskyl commited on
Commit
6534357
1 Parent(s): 5fe2f7b

Update model card

Browse files
Files changed (1) hide show
  1. README.md +57 -3
README.md CHANGED
@@ -1,3 +1,57 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ license: apache-2.0
5
+ base_model: bert-base-cased
6
+ tags:
7
+ - generated_from_trainer
8
+ datasets:
9
+ - glue
10
+ metrics:
11
+ - accuracy
12
+ model-index:
13
+ - name: sst2
14
+ results:
15
+ - task:
16
+ name: Text Classification
17
+ type: text-classification
18
+ dataset:
19
+ name: GLUE SST2
20
+ type: glue
21
+ args: sst2
22
+ metrics:
23
+ - name: Accuracy
24
+ type: accuracy
25
+ value: 0.926605504587156
26
+ ---
27
+
28
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
29
+ should probably proofread and complete it, then remove this comment. -->
30
+
31
+ # bert-base-cased-sst2
32
+ This model is a fine-tuned version of [bert-base-cased](https://huggingface.co/bert-base-cased) on the GLUE SST2 dataset.
33
+ It achieves the following results on the evaluation set:
34
+ - Loss: 0.2890
35
+ - Accuracy: 0.9266
36
+
37
+ ## Model description
38
+ Please refer to [this repository](https://huggingface.co/google-bert/bert-base-cased).
39
+
40
+ ## Intended uses
41
+ This model is for the artifact evaluation of the paper "SHAFT: Secure, Handy, Accurate, and Fast Transformer Inference."
42
+
43
+ ### Training hyperparameters
44
+ The following hyperparameters were used during training:
45
+ - learning_rate: 2e-05
46
+ - train_batch_size: 64
47
+ - eval_batch_size: 16
48
+ - seed: 42
49
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
50
+ - lr_scheduler_type: linear
51
+ - num_epochs: 3.0
52
+
53
+ ### Framework versions
54
+ - Transformers 4.42.0.dev0
55
+ - Pytorch 2.0.1+cu118
56
+ - Datasets 2.20.0
57
+ - Tokenizers 0.19.1