Update README.md
Browse files
README.md
CHANGED
@@ -22,13 +22,15 @@ On the other hand, there is a significant lack of research focused on extracting
|
|
22 |
|
23 |
LayoutLM [1] is an excellent solution for the problems because, at its core, it is a regular BERT-alike model, but it is uniquely capable of embedding positional information about the text alongside the text itself.
|
24 |
|
25 |
-
We have fine-tuned the model on the DocVQA [2] dataset,
|
26 |
-
|
27 |
-
| Model
|
28 |
-
|
29 |
-
| all-mpnet-base-v2
|
30 |
-
|
|
31 |
-
|
|
|
|
|
|
32 |
|
33 |
### Usage
|
34 |
Please refer to the Colab workbook or the blog post to learn more!
|
|
|
22 |
|
23 |
LayoutLM [1] is an excellent solution for the problems because, at its core, it is a regular BERT-alike model, but it is uniquely capable of embedding positional information about the text alongside the text itself.
|
24 |
|
25 |
+
We have fine-tuned the model on the DocVQA [2] dataset, showing the potential improvement upon the current SOTA [4]:
|
26 |
+
|
27 |
+
| Model | HR@3 | HR@5 | HR@10 |
|
28 |
+
|---------------------------------|----------------|----------------|----------------|
|
29 |
+
| all-mpnet-base-v2 | 0.2500 | 0.2900 | 0.3600 |
|
30 |
+
| gte-base-en-v1.5 | 0.3454 | 0.3899 | 0.4554 |
|
31 |
+
| snowflake-arctic-embed-m-v1.5 | **0.3548** | 0.4042 | 0.4573 |
|
32 |
+
| LayoutLM-Byne (our model) | 0.3491 | **0.4269** | **0.5436** |
|
33 |
+
| Improvement over best competitor| -1.61% | +5.62% | +18.87% |
|
34 |
|
35 |
### Usage
|
36 |
Please refer to the Colab workbook or the blog post to learn more!
|