velaa commited on
Commit
fb134ed
1 Parent(s): 9417477

End of training

Browse files
README.md CHANGED
@@ -5,9 +5,24 @@ tags:
5
  - generated_from_trainer
6
  datasets:
7
  - glue
 
 
8
  model-index:
9
  - name: opt-125m-finetuned-mnli
10
- results: []
 
 
 
 
 
 
 
 
 
 
 
 
 
11
  ---
12
 
13
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -17,13 +32,8 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [facebook/opt-125m](https://huggingface.co/facebook/opt-125m) on the glue dataset.
19
  It achieves the following results on the evaluation set:
20
- - eval_loss: 0.6714
21
- - eval_accuracy: 0.7178
22
- - eval_runtime: 42.0601
23
- - eval_samples_per_second: 233.356
24
- - eval_steps_per_second: 14.598
25
- - epoch: 0.03
26
- - step: 669
27
 
28
  ## Model description
29
 
@@ -50,6 +60,17 @@ The following hyperparameters were used during training:
50
  - lr_scheduler_type: linear
51
  - num_epochs: 5
52
 
 
 
 
 
 
 
 
 
 
 
 
53
  ### Framework versions
54
 
55
  - Transformers 4.35.2
 
5
  - generated_from_trainer
6
  datasets:
7
  - glue
8
+ metrics:
9
+ - accuracy
10
  model-index:
11
  - name: opt-125m-finetuned-mnli
12
+ results:
13
+ - task:
14
+ name: Text Classification
15
+ type: text-classification
16
+ dataset:
17
+ name: glue
18
+ type: glue
19
+ config: mnli
20
+ split: validation_matched
21
+ args: mnli
22
+ metrics:
23
+ - name: Accuracy
24
+ type: accuracy
25
+ value: 0.3467142129393785
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
32
 
33
  This model is a fine-tuned version of [facebook/opt-125m](https://huggingface.co/facebook/opt-125m) on the glue dataset.
34
  It achieves the following results on the evaluation set:
35
+ - Loss: 1.2410
36
+ - Accuracy: 0.3467
 
 
 
 
 
37
 
38
  ## Model description
39
 
 
60
  - lr_scheduler_type: linear
61
  - num_epochs: 5
62
 
63
+ ### Training results
64
+
65
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy |
66
+ |:-------------:|:-----:|:----:|:---------------:|:--------:|
67
+ | No log | 1.0 | 1 | 1.2436 | 0.3416 |
68
+ | No log | 2.0 | 2 | 1.2381 | 0.3465 |
69
+ | No log | 3.0 | 3 | 1.2410 | 0.3467 |
70
+ | No log | 4.0 | 4 | 1.2437 | 0.3458 |
71
+ | No log | 5.0 | 5 | 1.2450 | 0.3467 |
72
+
73
+
74
  ### Framework versions
75
 
76
  - Transformers 4.35.2
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d3c469e3bac8259988c8973f6cc6f5e42e78ae890096872c9a30a3a6f1ef0878
3
  size 500988904
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1cc0640e47c1d2be31a5bec3458a7cbd34a8b0d9a6e77b7986596aacc8e5d643
3
  size 500988904
runs/Dec01_04-56-25_71d6bd7cea63/events.out.tfevents.1701406983.71d6bd7cea63.1693.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c050a5dbf8ba6faccb4861a0449e883692f28df60cf155967f5638841c7c3f04
3
+ size 405