End of training

Browse files

Files changed (5) hide show

README.md +16 -16
model.safetensors +1 -1
runs/Dec13_07-36-12_a8ef923cfc5d/events.out.tfevents.1702452978.a8ef923cfc5d.857.2 +3 -0
tokenizer.json +2 -2
training_args.bin +2 -2

README.md CHANGED Viewed

@@ -17,12 +17,12 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.9574
-- Rouge1: 62.2222
-- Rouge2: 42.8571
-- Rougel: 62.2222
-- Rougelsum: 62.7778
-- Gen Len: 12.0
 ## Model description
@@ -53,16 +53,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
-| No log        | 1.0   | 7    | 1.7987          | 61.1111 | 38.8889 | 61.1111 | 61.3889   | 10.0    |
-| No log        | 2.0   | 14   | 1.7049          | 60.0    | 37.3016 | 58.8889 | 60.0      | 10.5    |
-| No log        | 3.0   | 21   | 1.6977          | 60.0    | 37.3016 | 58.8889 | 60.0      | 10.5    |
-| No log        | 4.0   | 28   | 1.8914          | 65.5556 | 45.6349 | 65.5556 | 65.5556   | 10.8333 |
-| No log        | 5.0   | 35   | 1.9302          | 62.2222 | 42.8571 | 62.2222 | 62.7778   | 12.0    |
-| No log        | 6.0   | 42   | 1.9448          | 62.2222 | 42.8571 | 62.2222 | 62.7778   | 12.0    |
-| No log        | 7.0   | 49   | 1.9859          | 62.2222 | 42.8571 | 62.2222 | 62.7778   | 12.0    |
-| No log        | 8.0   | 56   | 1.9541          | 62.2222 | 42.8571 | 62.2222 | 62.7778   | 12.0    |
-| No log        | 9.0   | 63   | 1.9647          | 62.2222 | 42.8571 | 62.2222 | 62.7778   | 12.0    |
-| No log        | 10.0  | 70   | 1.9574          | 62.2222 | 42.8571 | 62.2222 | 62.7778   | 12.0    |
 ### Framework versions

 This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.0064
+- Rouge1: 48.6395
+- Rouge2: 26.1905
+- Rougel: 47.7211
+- Rougelsum: 48.1633
+- Gen Len: 10.5714
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
+| No log        | 1.0   | 8    | 2.5594          | 10.2165 | 0.0     | 9.9567  | 10.2165   | 11.0    |
+| No log        | 2.0   | 16   | 2.0407          | 25.7143 | 14.2857 | 25.7143 | 25.7143   | 6.0     |
+| No log        | 3.0   | 24   | 1.7822          | 46.0317 | 33.3333 | 46.0317 | 45.7143   | 11.7143 |
+| No log        | 4.0   | 32   | 1.5058          | 50.3401 | 32.8571 | 49.1156 | 49.932    | 11.0    |
+| No log        | 5.0   | 40   | 1.2425          | 48.6395 | 26.1905 | 47.7211 | 48.1633   | 11.7143 |
+| No log        | 6.0   | 48   | 1.1249          | 52.8912 | 33.3333 | 52.2109 | 52.2109   | 13.4286 |
+| No log        | 7.0   | 56   | 1.0713          | 41.7687 | 21.4286 | 40.9524 | 41.7687   | 9.7143  |
+| No log        | 8.0   | 64   | 1.0475          | 46.1224 | 21.4286 | 45.1701 | 45.4422   | 10.1429 |
+| No log        | 9.0   | 72   | 1.0154          | 43.8095 | 16.6667 | 42.7211 | 42.9932   | 10.7143 |
+| No log        | 10.0  | 80   | 1.0064          | 48.6395 | 26.1905 | 47.7211 | 48.1633   | 10.5714 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ce5df2a978d825ecc3f8dd009d278630a2d0247395ad5366ee6283674dbfd74a
 size 307867048

 version https://git-lfs.github.com/spec/v1
+oid sha256:cf626a11861b9965737b34222a931cc0b246f7676fb38d6a1bdebb6218967175
 size 307867048

runs/Dec13_07-36-12_a8ef923cfc5d/events.out.tfevents.1702452978.a8ef923cfc5d.857.2 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:8867df26c4b253e6fe9e3d25606fc2e8145615dbf349af62b63c4a9108ee4241
+size 10809

tokenizer.json CHANGED Viewed

@@ -2,13 +2,13 @@
   "version": "1.0",
   "truncation": {
     "direction": "Right",
-    "max_length": 249,
     "strategy": "LongestFirst",
     "stride": 0
   },
   "padding": {
     "strategy": {
-      "Fixed": 249
     },
     "direction": "Right",
     "pad_to_multiple_of": null,

   "version": "1.0",
   "truncation": {
     "direction": "Right",
+    "max_length": 307,
     "strategy": "LongestFirst",
     "stride": 0
   },
   "padding": {
     "strategy": {
+      "Fixed": 307
     },
     "direction": "Right",
     "pad_to_multiple_of": null,

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3f85d224bddbf0e4f736c7df1a936c5420b088751daeee4c7e5ae3e83e7a6839
-size 4728

 version https://git-lfs.github.com/spec/v1
+oid sha256:9d201fcae9071d8f202b1f02da6b5990fbe3200d35b22750145f519e5317d30e
+size 4792