Venkatesh4342
/

pegasus-samsum

@@ -1,5 +1,5 @@
 ---
-base_model: google/pegasus-cnn_dailymail
 tags:
 - generated_from_trainer
 datasets:
@@ -21,7 +21,7 @@ model-index:
     metrics:
     - name: Rouge1
       type: rouge
-      value: 0.4616
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -29,14 +29,14 @@ should probably proofread and complete it, then remove this comment. -->
 # pegasus-samsum
-This model is a fine-tuned version of [google/pegasus-cnn_dailymail](https://huggingface.co/google/pegasus-cnn_dailymail) on the samsum dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.3665
-- Rouge1: 0.4616
-- Rouge2: 0.2275
-- Rougel: 0.3725
-- Rougelsum: 0.3738
-- Gen Len: 35.6667
 ## Model description
@@ -55,25 +55,32 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 5.750420024069848e-05
-- train_batch_size: 8
-- eval_batch_size: 8
 - seed: 42
-- gradient_accumulation_steps: 16
-- total_train_batch_size: 128
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
-- num_epochs: 4
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
-| 2.2186        | 0.87  | 100  | 1.7567          | 0.3571 | 0.1437 | 0.2779 | 0.2797    | 46.7733 |
-| 1.7368        | 1.74  | 200  | 1.4933          | 0.4347 | 0.2053 | 0.3459 | 0.3461    | 35.4533 |
-| 1.6744        | 2.61  | 300  | 1.4059          | 0.4547 | 0.2179 | 0.3629 | 0.3634    | 35.68   |
-| 1.5978        | 3.47  | 400  | 1.3665          | 0.4616 | 0.2275 | 0.3725 | 0.3738    | 35.6667 |
 ### Framework versions

 ---
+base_model: google/pegasus-large
 tags:
 - generated_from_trainer
 datasets:
     metrics:
     - name: Rouge1
       type: rouge
+      value: 0.4659
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # pegasus-samsum
+This model is a fine-tuned version of [google/pegasus-large](https://huggingface.co/google/pegasus-large) on the samsum dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.4091
+- Rouge1: 0.4659
+- Rouge2: 0.2345
+- Rougel: 0.3946
+- Rougelsum: 0.3951
+- Gen Len: 17.7467
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 2e-05
+- train_batch_size: 4
+- eval_batch_size: 4
 - seed: 42
+- gradient_accumulation_steps: 2
+- total_train_batch_size: 8
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
+- num_epochs: 3
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
+| 1.8025        | 0.27  | 500  | 1.4403          | 0.4466 | 0.2101 | 0.3832 | 0.3841    | 21.64   |
+| 1.5936        | 0.54  | 1000 | 1.3766          | 0.4786 | 0.2374 | 0.4017 | 0.4013    | 21.24   |
+| 1.5926        | 0.81  | 1500 | 1.3910          | 0.5118 | 0.2643 | 0.4282 | 0.4286    | 20.2267 |
+| 1.5067        | 1.09  | 2000 | 1.4028          | 0.4982 | 0.261  | 0.4155 | 0.4157    | 20.4267 |
+| 1.5712        | 1.36  | 2500 | 1.4236          | 0.4712 | 0.234  | 0.3964 | 0.3969    | 17.0    |
+| 1.6177        | 1.63  | 3000 | 1.4151          | 0.4768 | 0.2382 | 0.4019 | 0.4022    | 16.28   |
+| 1.6289        | 1.9   | 3500 | 1.4112          | 0.4744 | 0.2346 | 0.402  | 0.4033    | 17.0267 |
+| 1.6326        | 2.17  | 4000 | 1.4096          | 0.4682 | 0.234  | 0.3985 | 0.3994    | 17.1333 |
+| 1.5929        | 2.44  | 4500 | 1.4093          | 0.4637 | 0.2342 | 0.3939 | 0.3942    | 17.16   |
+| 1.4351        | 2.72  | 5000 | 1.4090          | 0.4684 | 0.2346 | 0.3953 | 0.3955    | 17.8133 |
+| 1.6445        | 2.99  | 5500 | 1.4091          | 0.4659 | 0.2345 | 0.3946 | 0.3951    | 17.7467 |
 ### Framework versions

generation_config.json CHANGED Viewed

@@ -4,8 +4,7 @@
   "eos_token_id": 1,
   "forced_eos_token_id": 1,
   "length_penalty": 0.8,
-  "max_length": 128,
-  "min_length": 32,
   "num_beams": 8,
   "pad_token_id": 0,
   "transformers_version": "4.33.1"

   "eos_token_id": 1,
   "forced_eos_token_id": 1,
   "length_penalty": 0.8,
+  "max_length": 256,
   "num_beams": 8,
   "pad_token_id": 0,
   "transformers_version": "4.33.1"

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a231eec02b23d1fd6b3262471a927754ba96c7bdc25d6ce79c6f7fb81505cde5
 size 2283804653

 version https://git-lfs.github.com/spec/v1
+oid sha256:73d0af06060e82a368bcafdbfc8eb5ba5ad6071968bc699e79006b1c515db840
 size 2283804653