End of training
Browse files
README.md
CHANGED
@@ -1,8 +1,10 @@
|
|
1 |
---
|
2 |
license: apache-2.0
|
|
|
3 |
tags:
|
4 |
- generated_from_trainer
|
5 |
-
|
|
|
6 |
model-index:
|
7 |
- name: Vit-GPT2-COCO2017Flickr-40k-05
|
8 |
results: []
|
@@ -13,19 +15,14 @@ should probably proofread and complete it, then remove this comment. -->
|
|
13 |
|
14 |
# Vit-GPT2-COCO2017Flickr-40k-05
|
15 |
|
16 |
-
This model is a fine-tuned version of [
|
17 |
It achieves the following results on the evaluation set:
|
18 |
-
-
|
19 |
-
-
|
20 |
-
-
|
21 |
-
-
|
22 |
-
-
|
23 |
-
-
|
24 |
-
- eval_runtime: 302.1813
|
25 |
-
- eval_samples_per_second: 9.928
|
26 |
-
- eval_steps_per_second: 2.482
|
27 |
-
- epoch: 0.35
|
28 |
-
- step: 3500
|
29 |
|
30 |
## Model description
|
31 |
|
@@ -52,6 +49,42 @@ The following hyperparameters were used during training:
|
|
52 |
- lr_scheduler_type: linear
|
53 |
- num_epochs: 3.0
|
54 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
55 |
### Framework versions
|
56 |
|
57 |
- Transformers 4.39.3
|
|
|
1 |
---
|
2 |
license: apache-2.0
|
3 |
+
base_model: NourFakih/Vit-GPT2-COCO2017Flickr-40k-05
|
4 |
tags:
|
5 |
- generated_from_trainer
|
6 |
+
metrics:
|
7 |
+
- rouge
|
8 |
model-index:
|
9 |
- name: Vit-GPT2-COCO2017Flickr-40k-05
|
10 |
results: []
|
|
|
15 |
|
16 |
# Vit-GPT2-COCO2017Flickr-40k-05
|
17 |
|
18 |
+
This model is a fine-tuned version of [NourFakih/Vit-GPT2-COCO2017Flickr-40k-05](https://huggingface.co/NourFakih/Vit-GPT2-COCO2017Flickr-40k-05) on an unknown dataset.
|
19 |
It achieves the following results on the evaluation set:
|
20 |
+
- Loss: 0.5528
|
21 |
+
- Rouge1: 44.1624
|
22 |
+
- Rouge2: 19.6736
|
23 |
+
- Rougel: 40.3898
|
24 |
+
- Rougelsum: 40.4029
|
25 |
+
- Gen Len: 12.263
|
|
|
|
|
|
|
|
|
|
|
26 |
|
27 |
## Model description
|
28 |
|
|
|
49 |
- lr_scheduler_type: linear
|
50 |
- num_epochs: 3.0
|
51 |
|
52 |
+
### Training results
|
53 |
+
|
54 |
+
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
|
55 |
+
|:-------------:|:-----:|:-----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
|
56 |
+
| 0.1497 | 0.1 | 500 | 0.5462 | 40.1774 | 14.6199 | 36.3335 | 36.3518 | 12.5965 |
|
57 |
+
| 0.1604 | 0.2 | 1000 | 0.5302 | 41.4714 | 16.0237 | 37.5992 | 37.5915 | 11.914 |
|
58 |
+
| 0.1631 | 0.3 | 1500 | 0.5436 | 40.3816 | 14.6958 | 36.6109 | 36.6027 | 12.3295 |
|
59 |
+
| 0.1634 | 0.4 | 2000 | 0.5266 | 40.9484 | 15.9068 | 37.5194 | 37.5088 | 12.033 |
|
60 |
+
| 0.1576 | 0.5 | 2500 | 0.5544 | 40.373 | 15.012 | 36.5218 | 36.5141 | 12.3345 |
|
61 |
+
| 0.1599 | 0.6 | 3000 | 0.5425 | 40.7552 | 15.2754 | 37.1059 | 37.1299 | 12.191 |
|
62 |
+
| 0.291 | 0.7 | 3500 | 0.4545 | 41.5934 | 16.251 | 37.7291 | 37.7113 | 12.0295 |
|
63 |
+
| 0.2825 | 0.8 | 4000 | 0.4558 | 42.6728 | 17.1703 | 38.8692 | 38.8841 | 12.246 |
|
64 |
+
| 0.2737 | 0.9 | 4500 | 0.4565 | 43.0036 | 16.8421 | 39.1761 | 39.1693 | 11.7975 |
|
65 |
+
| 0.2683 | 1.0 | 5000 | 0.4576 | 42.1341 | 16.7973 | 38.2881 | 38.3083 | 11.8655 |
|
66 |
+
| 0.1687 | 1.1 | 5500 | 0.4996 | 41.7152 | 16.4042 | 37.7724 | 37.7629 | 12.384 |
|
67 |
+
| 0.168 | 1.2 | 6000 | 0.5046 | 41.6521 | 16.6159 | 37.7915 | 37.7778 | 12.661 |
|
68 |
+
| 0.1688 | 1.3 | 6500 | 0.5020 | 42.3292 | 17.1408 | 38.5407 | 38.5282 | 11.846 |
|
69 |
+
| 0.1682 | 1.4 | 7000 | 0.5045 | 42.848 | 17.6905 | 38.9854 | 38.9896 | 12.025 |
|
70 |
+
| 0.1703 | 1.5 | 7500 | 0.5103 | 42.1175 | 16.7765 | 38.3023 | 38.3199 | 12.4315 |
|
71 |
+
| 0.1618 | 1.6 | 8000 | 0.5019 | 43.207 | 17.8145 | 39.3822 | 39.3884 | 12.3485 |
|
72 |
+
| 0.1657 | 1.7 | 8500 | 0.4945 | 42.8399 | 17.8975 | 39.1618 | 39.1951 | 11.8575 |
|
73 |
+
| 0.1643 | 1.8 | 9000 | 0.5064 | 43.0186 | 17.8969 | 39.2518 | 39.2735 | 12.0095 |
|
74 |
+
| 0.1654 | 1.9 | 9500 | 0.5011 | 43.2785 | 18.2603 | 39.4479 | 39.4437 | 12.2305 |
|
75 |
+
| 0.158 | 2.0 | 10000 | 0.4945 | 43.3824 | 18.3183 | 39.3471 | 39.3334 | 12.1495 |
|
76 |
+
| 0.1096 | 2.1 | 10500 | 0.5520 | 43.5068 | 18.4313 | 39.7084 | 39.7205 | 12.112 |
|
77 |
+
| 0.1037 | 2.2 | 11000 | 0.5510 | 43.1909 | 18.1204 | 39.1945 | 39.2052 | 12.349 |
|
78 |
+
| 0.1045 | 2.3 | 11500 | 0.5453 | 42.9965 | 18.4064 | 39.0931 | 39.0868 | 12.1825 |
|
79 |
+
| 0.1027 | 2.4 | 12000 | 0.5473 | 43.4973 | 18.8697 | 39.944 | 39.9407 | 12.447 |
|
80 |
+
| 0.1034 | 2.5 | 12500 | 0.5512 | 43.9534 | 19.327 | 40.0946 | 40.0724 | 12.2395 |
|
81 |
+
| 0.1018 | 2.6 | 13000 | 0.5527 | 43.7136 | 19.1214 | 39.9218 | 39.9274 | 12.3245 |
|
82 |
+
| 0.0986 | 2.7 | 13500 | 0.5557 | 44.0502 | 19.3213 | 40.0291 | 40.0286 | 12.3345 |
|
83 |
+
| 0.0953 | 2.8 | 14000 | 0.5510 | 44.0001 | 19.4482 | 40.1204 | 40.1175 | 12.1255 |
|
84 |
+
| 0.098 | 2.9 | 14500 | 0.5534 | 43.9554 | 19.4673 | 40.1401 | 40.1521 | 12.2395 |
|
85 |
+
| 0.0947 | 3.0 | 15000 | 0.5528 | 44.1624 | 19.6736 | 40.3898 | 40.4029 | 12.263 |
|
86 |
+
|
87 |
+
|
88 |
### Framework versions
|
89 |
|
90 |
- Transformers 4.39.3
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 956835520
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:a6dad99f38a8621a218d8d8fa95c853ef60b7ba5ea7c3da9e0b800d757b15e3b
|
3 |
size 956835520
|
runs/May28_10-49-49_453cbf5e9962/events.out.tfevents.1716893390.453cbf5e9962.34.0
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f6ca83bca4bf62090ab04ae811949b31325b98844694085bae8872faf8d03c91
|
3 |
+
size 31536
|