rubentito
/

t5-base-mpdocvqa

@@ -5,7 +5,7 @@ tags:
 - Document Question Answering
 - Document Visual Question Answering
 datasets:
-- MP-DocVQA
 language:
 - en
 ---
@@ -19,18 +19,6 @@ This model was used as a baseline in [Hierarchical multimodal transformers for M
 - Results on the MP-DocVQA dataset are reported in Table 2.
 - Training hyperparameters can be found in Table 8 of Appendix D.
-## Model results
-Extended experimentation can be found in Table 2 of [Hierarchical multimodal transformers for Multi-Page DocVQA](https://arxiv.org/pdf/2212.05935.pdf).
-You can also check the live leaderboard at the [RRC Portal](https://rrc.cvc.uab.es/?ch=17&com=evaluation&task=4).
-| Model 		 																	| HF name								| ANLS 			| APPA		|
-|-----------------------------------------------------------------------------------|:--------------------------------------|:-------------:|:---------:|
-| [Bert-large](https://huggingface.co/rubentito/bert-large-mpdocvqa)	            | rubentito/bert-large-mpdocvqa			| 0.4183 		| 51.6177 	|
-| [Longformer-base](https://huggingface.co/rubentito/longformer-base-mpdocvqa)		| rubentito/longformer-base-mpdocvqa	| 0.5287		| 71.1696 	|
-| [BigBird ITC base](https://huggingface.co/rubentito/bigbird-base-itc-mpdocvqa)	| rubentito/bigbird-base-itc-mpdocvqa	| 0.4929		| 67.5433 	|
-| [LayoutLMv3 base](https://huggingface.co/rubentito/layoutlmv3-base-mpdocvqa)		| rubentito/layoutlmv3-base-mpdocvqa	| 0.4538		| 51.9426 	|
-| [**T5 base**](https://huggingface.co/rubentito/t5-base-mpdocvqa)					| rubentito/t5-base-mpdocvqa			| 0.5050		| 0.0000 	|
-| Hi-VT5 																			| TBA 									| 0.6201		| 79.23		|
 ## How to use
@@ -52,6 +40,20 @@ output = self.model.generate(**encoding)
 answer = tokenizer.decode(output['sequences'], skip_special_tokens=True)
 ```
 ## BibTeX entry
 ```tex

 - Document Question Answering
 - Document Visual Question Answering
 datasets:
+- rubentito/mp-docvqa
 language:
 - en
 ---
 - Results on the MP-DocVQA dataset are reported in Table 2.
 - Training hyperparameters can be found in Table 8 of Appendix D.
 ## How to use
 answer = tokenizer.decode(output['sequences'], skip_special_tokens=True)
 ```
+## Model results
+Extended experimentation can be found in Table 2 of [Hierarchical multimodal transformers for Multi-Page DocVQA](https://arxiv.org/pdf/2212.05935.pdf).
+You can also check the live leaderboard at the [RRC Portal](https://rrc.cvc.uab.es/?ch=17&com=evaluation&task=4).
+| Model 		 																	| HF name								| ANLS 			| APPA		|
+|-----------------------------------------------------------------------------------|:--------------------------------------|:-------------:|:---------:|
+| [Bert-large](https://huggingface.co/rubentito/bert-large-mpdocvqa)	            | rubentito/bert-large-mpdocvqa			| 0.4183 		| 51.6177 	|
+| [Longformer-base](https://huggingface.co/rubentito/longformer-base-mpdocvqa)		| rubentito/longformer-base-mpdocvqa	| 0.5287		| 71.1696 	|
+| [BigBird ITC base](https://huggingface.co/rubentito/bigbird-base-itc-mpdocvqa)	| rubentito/bigbird-base-itc-mpdocvqa	| 0.4929		| 67.5433 	|
+| [LayoutLMv3 base](https://huggingface.co/rubentito/layoutlmv3-base-mpdocvqa)		| rubentito/layoutlmv3-base-mpdocvqa	| 0.4538		| 51.9426 	|
+| [**T5 base**](https://huggingface.co/rubentito/t5-base-mpdocvqa)					| rubentito/t5-base-mpdocvqa			| 0.5050		| 0.0000 	|
+| Hi-VT5 																			| TBA 									| 0.6201		| 79.23		|
 ## BibTeX entry
 ```tex