Kerz commited on
Commit
c0ddeeb
1 Parent(s): 7416198

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -6
README.md CHANGED
@@ -22,7 +22,7 @@ model-index:
22
  metrics:
23
  - name: Accuracy
24
  type: accuracy
25
- value: 0.22
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,8 +32,8 @@ should probably proofread and complete it, then remove this comment. -->
32
 
33
  This model is a fine-tuned version of [bert-base-cased](https://huggingface.co/bert-base-cased) on the yelp_review_full dataset.
34
  It achieves the following results on the evaluation set:
35
- - Loss: 1.6089
36
- - Accuracy: 0.22
37
 
38
  ## Model description
39
 
@@ -56,6 +56,8 @@ The following hyperparameters were used during training:
56
  - train_batch_size: 1
57
  - eval_batch_size: 8
58
  - seed: 42
 
 
59
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
60
  - lr_scheduler_type: linear
61
  - num_epochs: 3.0
@@ -64,9 +66,9 @@ The following hyperparameters were used during training:
64
 
65
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
66
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
67
- | 1.6639 | 1.0 | 1000 | 1.6370 | 0.166 |
68
- | 1.6549 | 2.0 | 2000 | 1.6164 | 0.22 |
69
- | 1.6264 | 3.0 | 3000 | 1.6089 | 0.22 |
70
 
71
 
72
  ### Framework versions
 
22
  metrics:
23
  - name: Accuracy
24
  type: accuracy
25
+ value: 0.499
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
32
 
33
  This model is a fine-tuned version of [bert-base-cased](https://huggingface.co/bert-base-cased) on the yelp_review_full dataset.
34
  It achieves the following results on the evaluation set:
35
+ - Loss: 1.1692
36
+ - Accuracy: 0.499
37
 
38
  ## Model description
39
 
 
56
  - train_batch_size: 1
57
  - eval_batch_size: 8
58
  - seed: 42
59
+ - gradient_accumulation_steps: 4
60
+ - total_train_batch_size: 4
61
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
62
  - lr_scheduler_type: linear
63
  - num_epochs: 3.0
 
66
 
67
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
68
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
69
+ | No log | 1.0 | 250 | 1.4265 | 0.391 |
70
+ | 1.4806 | 2.0 | 500 | 1.2233 | 0.458 |
71
+ | 1.4806 | 3.0 | 750 | 1.1692 | 0.499 |
72
 
73
 
74
  ### Framework versions