allenai
/

PRIMERA

Text2Text Generation

Inference Endpoints

Model card Files Files and versions Community

PRIMERA / README.md

wenxxx's picture

Update README.md

10f9b14 over 2 years ago

|

851 Bytes

	---
	license: apache-2.0
	---

	HF-version model for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization (ACL 2022).

	The original code can be found [here](https://github.com/allenai/PRIMER). You can find the script and notebook to train/evaluate the model in the original github repo.

	* Note: due to the difference between the implementations of the original Longformer and the Huggingface LED model, the results of converted models are slightly different. We run a sanity check on both fine-tuned and non fine-tuned models, and show the results below:

	\| Model \| Rouge-1 \| Rouge-2 \| Rouge-L \|
	\| --- \| ----------- \|----------- \|----------- \|
	\| PRIMERA \| 42.0 \| 13.6 \| 20.8\|
	\| PRIMERA-hf \| 41.7 \|13.6 \| 20.5\|
	\| PRIMERA(finetuned) \| 49.9 \| 21.1 \| 25.9\|
	\| PRIMERA-hf(finetuned) \| 49.9 \| 20.9 \| 25.8\|