Ayus077BCT014Bhandari commited on
Commit
f9f105d
1 Parent(s): c765571

End of training

Browse files
README.md ADDED
@@ -0,0 +1,80 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: google/mt5-small
4
+ tags:
5
+ - generated_from_trainer
6
+ metrics:
7
+ - accuracy
8
+ - precision
9
+ - recall
10
+ - f1
11
+ - wer
12
+ model-index:
13
+ - name: mt5_epoch_1To6
14
+ results: []
15
+ ---
16
+
17
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
18
+ should probably proofread and complete it, then remove this comment. -->
19
+
20
+ # mt5_epoch_1To6
21
+
22
+ This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
23
+ It achieves the following results on the evaluation set:
24
+ - Loss: nan
25
+ - Accuracy: 0.0
26
+ - Precision: 0.0
27
+ - Recall: 0.0
28
+ - F1: 0.0
29
+ - Wer: 1.0
30
+ - Cer: 1.0
31
+ - Chrf: 0.0
32
+ - Exact Match: 0.0
33
+ - Bertscore:precision: 0.0
34
+ - Bertscore:recall: 0.0
35
+ - Bertscore:f1: 0.0
36
+ - Ter: 100.0
37
+ - Blerurt: -1.3889
38
+
39
+ ## Model description
40
+
41
+ More information needed
42
+
43
+ ## Intended uses & limitations
44
+
45
+ More information needed
46
+
47
+ ## Training and evaluation data
48
+
49
+ More information needed
50
+
51
+ ## Training procedure
52
+
53
+ ### Training hyperparameters
54
+
55
+ The following hyperparameters were used during training:
56
+ - learning_rate: 5e-05
57
+ - train_batch_size: 4
58
+ - eval_batch_size: 4
59
+ - seed: 42
60
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
61
+ - lr_scheduler_type: linear
62
+ - lr_scheduler_warmup_steps: 500
63
+ - num_epochs: 5
64
+ - mixed_precision_training: Native AMP
65
+
66
+ ### Training results
67
+
68
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1 | Wer | Cer | Chrf | Exact Match | Bertscore:precision | Bertscore:recall | Bertscore:f1 | Ter | Blerurt |
69
+ |:-------------:|:-----:|:----:|:---------------:|:--------:|:---------:|:------:|:---:|:---:|:---:|:----:|:-----------:|:-------------------:|:----------------:|:------------:|:-----:|:-------:|
70
+ | 0.0 | 1.57 | 1000 | nan | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 100.0 | -1.3889 |
71
+ | 0.0 | 3.14 | 2000 | nan | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 100.0 | -1.3889 |
72
+ | 0.0 | 4.72 | 3000 | nan | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 100.0 | -1.3889 |
73
+
74
+
75
+ ### Framework versions
76
+
77
+ - Transformers 4.37.2
78
+ - Pytorch 2.1.0+cu121
79
+ - Datasets 2.17.0
80
+ - Tokenizers 0.15.2
generation_config.json ADDED
@@ -0,0 +1,7 @@
 
 
 
 
 
 
 
 
1
+ {
2
+ "_from_model_config": true,
3
+ "decoder_start_token_id": 0,
4
+ "eos_token_id": 1,
5
+ "pad_token_id": 0,
6
+ "transformers_version": "4.37.2"
7
+ }
logs_process_track/events.out.tfevents.1708174461.18162472cfa7.514.1 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:24db86e145ad84265eeacb162716e95c52da98c092947cba8cb16acdcfdb5355
3
- size 6827
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ba350b82992a9b8efb91826956bacea8735200427267baed27c5ab6d6f6e51d6
3
+ size 8284