End of training

Browse files

Files changed (5) hide show

README.md +17 -17
model.safetensors +1 -1
runs/Dec15_03-40-28_b763d5f39da5/events.out.tfevents.1702611635.b763d5f39da5.369.0 +3 -0
runs/Dec15_03-49-44_b763d5f39da5/events.out.tfevents.1702612189.b763d5f39da5.369.1 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -17,12 +17,12 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.2842
-- Rouge1: 64.1043
-- Rouge2: 39.0476
-- Rougel: 60.3628
-- Rougelsum: 60.9297
-- Gen Len: 14.5714
 ## Model description
@@ -53,21 +53,21 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
-| No log        | 1.0   | 8    | 2.0867          | 15.5226 | 0.0     | 11.4409 | 11.4409   | 15.5714 |
-| No log        | 2.0   | 16   | 1.1098          | 21.0884 | 0.0     | 20.5782 | 20.7483   | 17.0    |
-| No log        | 3.0   | 24   | 0.6635          | 52.381  | 21.4286 | 51.9728 | 52.381    | 11.0    |
-| No log        | 4.0   | 32   | 0.5018          | 52.8571 | 29.5238 | 51.7007 | 52.381    | 11.5714 |
-| No log        | 5.0   | 40   | 0.4491          | 56.7347 | 28.5714 | 53.1293 | 53.5828   | 12.7143 |
-| No log        | 6.0   | 48   | 0.3894          | 60.4535 | 38.5714 | 57.7551 | 57.8685   | 13.8571 |
-| No log        | 7.0   | 56   | 0.3616          | 60.4535 | 38.5714 | 57.7551 | 57.8685   | 14.0    |
-| No log        | 8.0   | 64   | 0.3140          | 64.1043 | 39.0476 | 60.3628 | 60.9297   | 14.5714 |
-| No log        | 9.0   | 72   | 0.2880          | 64.1043 | 39.0476 | 60.3628 | 60.9297   | 14.5714 |
-| No log        | 10.0  | 80   | 0.2842          | 64.1043 | 39.0476 | 60.3628 | 60.9297   | 14.5714 |
 ### Framework versions
 - Transformers 4.35.2
-- Pytorch 2.1.0+cu118
 - Datasets 2.15.0
 - Tokenizers 0.15.0

 This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4302
+- Rouge1: 63.0769
+- Rouge2: 60.1732
+- Rougel: 63.5165
+- Rougelsum: 63.0769
+- Gen Len: 11.8571
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
+| No log        | 1.0   | 8    | 2.3782          | 13.5065 | 0.0     | 13.7662 | 13.7662   | 11.0    |
+| No log        | 2.0   | 16   | 1.5121          | 36.1472 | 21.4286 | 37.6314 | 37.6314   | 15.4286 |
+| No log        | 3.0   | 24   | 1.0687          | 42.449  | 28.5714 | 44.0816 | 44.0816   | 10.1429 |
+| No log        | 4.0   | 32   | 0.8164          | 44.898  | 34.2857 | 46.9388 | 46.9388   | 12.8571 |
+| No log        | 5.0   | 40   | 0.6774          | 53.0887 | 43.8095 | 53.9325 | 54.8234   | 12.0    |
+| No log        | 6.0   | 48   | 0.5566          | 50.7483 | 45.2381 | 51.5646 | 52.381    | 10.2857 |
+| No log        | 7.0   | 56   | 0.4936          | 63.0769 | 60.1732 | 63.5165 | 63.0769   | 10.5714 |
+| No log        | 8.0   | 64   | 0.4619          | 63.0769 | 60.1732 | 63.5165 | 63.0769   | 11.8571 |
+| No log        | 9.0   | 72   | 0.4368          | 63.0769 | 60.1732 | 63.5165 | 63.0769   | 11.8571 |
+| No log        | 10.0  | 80   | 0.4302          | 63.0769 | 60.1732 | 63.5165 | 63.0769   | 11.8571 |
 ### Framework versions
 - Transformers 4.35.2
+- Pytorch 2.1.0+cu121
 - Datasets 2.15.0
 - Tokenizers 0.15.0

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2453be168d327d54dde73612ab58e93721261354b9ad686a3110f6d4d4611b6a
 size 307867048

 version https://git-lfs.github.com/spec/v1
+oid sha256:bbcb3a7f859cc9babc4a0966cc519313b7422f40f66f4f85cc5be71e84281621
 size 307867048

runs/Dec15_03-40-28_b763d5f39da5/events.out.tfevents.1702611635.b763d5f39da5.369.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a1dff011eb76ddd42a0af10613d5228e52553e3678d9fc36c1347e17ca55bfb6
+size 10809

runs/Dec15_03-49-44_b763d5f39da5/events.out.tfevents.1702612189.b763d5f39da5.369.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:ed13ad3d6def588b511050940c7c0fa43bd217e358f4867a6a1bf9ef3a717cba
+size 10809

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:58789ea0615c35016045bd1993b69d07628c6e6228fa7495f5d8b7bd122397c7
 size 4728

 version https://git-lfs.github.com/spec/v1
+oid sha256:98fde942d90846f8cb62f1dd27e9ebd5d366dcdade23dce6ecd9b4558c4d273f
 size 4728