jinho8345 commited on
Commit
1a026e5
1 Parent(s): a40ab69

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +24 -0
README.md ADDED
@@ -0,0 +1,24 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ This is a converted version of [bros-large-uncased](https://huggingface.co/naver-clova-ocr/bros-large-uncased) with [conversion script](https://github.com/huggingface/transformers/blob/main/src/transformers/models/bros/convert_bros_to_pytorch.py)
2
+
3
+
4
+ # BROS
5
+
6
+ GitHub: https://github.com/clovaai/bros
7
+
8
+ ## Introduction
9
+
10
+ BROS (BERT Relying On Spatiality) is a pre-trained language model focusing on text and layout for better key information extraction from documents.<br>
11
+ Given the OCR results of the document image, which are text and bounding box pairs, it can perform various key information extraction tasks, such as extracting an ordered item list from receipts.<br>
12
+ For more details, please refer to our paper:
13
+
14
+ BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents<br>
15
+ Teakgyu Hong, Donghyun Kim, Mingi Ji, Wonseok Hwang, Daehyun Nam, Sungrae Park<br>
16
+ AAAI 2022 - Main Technical Track
17
+
18
+ [[arXiv]](https://arxiv.org/abs/2108.04539)
19
+
20
+ ## Pre-trained models
21
+ | name | # params | Hugging Face - Models |
22
+ |---------------------|---------:|-------------------------------------------------------------------------------------------------|
23
+ | bros-base-uncased | < 110M | [naver-clova-ocr/bros-base-uncased](https://huggingface.co/naver-clova-ocr/bros-base-uncased) |
24
+ | bros-large-uncased (**this**) | < 340M | [naver-clova-ocr/bros-large-uncased](https://huggingface.co/naver-clova-ocr/bros-large-uncased) |