pszemraj
/

long-t5-tglobal-base-16384-booksci-summary-v1

text2text-generation

Generated from Trainer

long document summary

Inference Endpoints

Model card Files Files and versions Community

pszemraj commited on Apr 9, 2023

Commit

4c281ba

•

1 Parent(s): b13a2da

Update README.md

Files changed (1) hide show

README.md +12 -7

README.md CHANGED Viewed

@@ -209,13 +209,6 @@ An experiment investigating transfer learning capabilities by fine-tuning models
 This model is a fine-tuned version of [pszemraj/long-t5-tglobal-base-16384-book-summary](https://huggingface.co/pszemraj/long-t5-tglobal-base-16384-book-summary) on the `pszemraj/scientific_lay_summarisation-elife-norm` dataset for two epochs.
-It achieves the following results on the evaluation set:
-- Loss: 2.3994
-- Rouge1: 34.2428
-- Rouge2: 4.3644
-- Rougel: 12.5332
-- Rougelsum: 30.6965
-- Gen Len: 294.0249
 ## Usage
@@ -245,6 +238,18 @@ print(summary)
 ## Training procedure
 ### Training hyperparameters
 The following hyperparameters were used during training:

 This model is a fine-tuned version of [pszemraj/long-t5-tglobal-base-16384-book-summary](https://huggingface.co/pszemraj/long-t5-tglobal-base-16384-book-summary) on the `pszemraj/scientific_lay_summarisation-elife-norm` dataset for two epochs.
 ## Usage
 ## Training procedure
+> Note: this model was trained at a lower LR & not till "absolute convergence" with the intention of retaining some of the properties learned from the initial fine-tuning on `booksum`
+### Results
+It achieves the following results on the evaluation set:
+- Loss: 2.3994
+- Rouge1: 34.2428
+- Rouge2: 4.3644
+- Rougel: 12.5332
+- Rougelsum: 30.6965
+- Gen Len: 294.0249
 ### Training hyperparameters
 The following hyperparameters were used during training: