pradeepmishra1107
/

model-pradeep-flan-t5-small

@@ -16,8 +16,6 @@ should probably proofread and complete it, then remove this comment. -->
 # model-pradeep-flan-t5-small
 This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on the squad dataset.
-It achieves the following results on the evaluation set:
-- Loss: 1.1329
 ## Model description
@@ -42,37 +40,13 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 25
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 250  | 2.2352          |
-| 2.9785        | 2.0   | 500  | 1.7603          |
-| 2.9785        | 3.0   | 750  | 1.4722          |
-| 2.006         | 4.0   | 1000 | 1.3617          |
-| 2.006         | 5.0   | 1250 | 1.2789          |
-| 1.6474        | 6.0   | 1500 | 1.2341          |
-| 1.6474        | 7.0   | 1750 | 1.2316          |
-| 1.4551        | 8.0   | 2000 | 1.2106          |
-| 1.4551        | 9.0   | 2250 | 1.1793          |
-| 1.3309        | 10.0  | 2500 | 1.1819          |
-| 1.3309        | 11.0  | 2750 | 1.1734          |
-| 1.2264        | 12.0  | 3000 | 1.1587          |
-| 1.2264        | 13.0  | 3250 | 1.1433          |
-| 1.1625        | 14.0  | 3500 | 1.1390          |
-| 1.1625        | 15.0  | 3750 | 1.1471          |
-| 1.1101        | 16.0  | 4000 | 1.1345          |
-| 1.1101        | 17.0  | 4250 | 1.1315          |
-| 1.055         | 18.0  | 4500 | 1.1458          |
-| 1.055         | 19.0  | 4750 | 1.1278          |
-| 1.032         | 20.0  | 5000 | 1.1287          |
-| 1.032         | 21.0  | 5250 | 1.1417          |
-| 0.9976        | 22.0  | 5500 | 1.1390          |
-| 0.9976        | 23.0  | 5750 | 1.1286          |
-| 1.0106        | 24.0  | 6000 | 1.1336          |
-| 1.0106        | 25.0  | 6250 | 1.1329          |
 ### Framework versions

 # model-pradeep-flan-t5-small
 This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on the squad dataset.
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 1
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 250  | 5.1725          |
 ### Framework versions

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8ed0072bd50cbf1027acf9afc7ada265c77568d868e5c705882ef8e9c28143da
 size 242116870

 version https://git-lfs.github.com/spec/v1
+oid sha256:554d33f89c381833cace541802224b330b109e33f371afb905fa68a7a55fb96f
 size 242116870

tokenizer.json CHANGED Viewed

@@ -1,7 +1,21 @@
 {
   "version": "1.0",
-  "truncation": null,
-  "padding": null,
   "added_tokens": [
     {
       "id": 0,

 {
   "version": "1.0",
+  "truncation": {
+    "direction": "Right",
+    "max_length": 384,
+    "strategy": "OnlySecond",
+    "stride": 0
+  },
+  "padding": {
+    "strategy": {
+      "Fixed": 384
+    },
+    "direction": "Right",
+    "pad_to_multiple_of": null,
+    "pad_id": 0,
+    "pad_type_id": 0,
+    "pad_token": "<pad>"
+  },
   "added_tokens": [
     {
       "id": 0,

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6d968f94dabb0ab57b25a9190a439056d3c73c57704c4412f4c3f8c36ca1b612
 size 4536

 version https://git-lfs.github.com/spec/v1
+oid sha256:067980a05fa77a60ade7de4af71e375c3ffe40b6561544483cba8474c3b5922b
 size 4536