synthsumm
Collection
generalist summarizers trained on curated long docs + synthetic LLM summaries
β’
5 items
β’
Updated
Fine-tuned on a synthetic dataset of curated long-context text and GPT-3.5-turbo-1106
summaries spanning multiple domains + "random" long-context examples from pretraining datasets
Try it: gradio demo | example outputs .md (gauntlet) | code for free HF inference api
It's recommended to use this model with beam search decoding. If interested, you can also use the textsum
util repo to have most of this abstracted out for you:
pip install -U textsum
from textsum.summarize import Summarizer
model_name = "pszemraj/pegasus-x-large-book_synthsumm"
summarizer = Summarizer(model_name) # GPU auto-detected
text = "put the text you don't want to read here"
summary = summarizer.summarize_string(text)
print(summary)
This model is a fine-tuned version of pszemraj/pegasus-x-large-book-summary on the None dataset. It achieves the following results on the evaluation set:
The following hyperparameters were used during training:
Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
---|---|---|---|---|---|---|---|---|
1.7369 | 0.38 | 125 | 1.7140 | 43.0265 | 15.8613 | 30.5774 | 38.2507 | 77.0462 |
1.7736 | 0.77 | 250 | 1.6361 | 43.0209 | 15.2384 | 29.7678 | 37.4955 | 67.6 |
1.4251 | 1.15 | 375 | 1.5931 | 46.2138 | 17.5559 | 33.0091 | 41.0385 | 74.1077 |
1.2706 | 1.54 | 500 | 1.5635 | 44.6382 | 16.5917 | 30.7551 | 39.8466 | 71.7231 |
1.4844 | 1.92 | 625 | 1.5481 | 48.141 | 19.1137 | 33.647 | 42.1211 | 73.9846 |
Base model
google/pegasus-x-large