Boriscii commited on
Commit
76aaefb
1 Parent(s): 5f16ab8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -7
README.md CHANGED
@@ -22,13 +22,15 @@ On the other hand, there is a significant lack of research focused on extracting
22
 
23
  LayoutLM [1] is an excellent solution for the problems because, at its core, it is a regular BERT-alike model, but it is uniquely capable of embedding positional information about the text alongside the text itself.
24
 
25
- We have fine-tuned the model on the DocVQA [2] dataset, far surpassing the current SOTA (all-mpnet-base-v2) [3]:
26
-
27
- | Model | HR@3 | HR@5 | HR@10 |
28
- |-------|------|------|-------|
29
- | all-mpnet-base-v2 (Baseline) | 0.2505 | 0.2941 | 0.3624 |
30
- | LayoutLM (Our Model) | 0.3159 | 0.3909 | 0.5019 |
31
- | Relative Improvement | +26.1% | +32.9% | +38.5% |
 
 
32
 
33
  ### Usage
34
  Please refer to the Colab workbook or the blog post to learn more!
 
22
 
23
  LayoutLM [1] is an excellent solution for the problems because, at its core, it is a regular BERT-alike model, but it is uniquely capable of embedding positional information about the text alongside the text itself.
24
 
25
+ We have fine-tuned the model on the DocVQA [2] dataset, showing the potential improvement upon the current SOTA [4]:
26
+
27
+ | Model | HR@3 | HR@5 | HR@10 |
28
+ |---------------------------------|----------------|----------------|----------------|
29
+ | all-mpnet-base-v2 | 0.2500 | 0.2900 | 0.3600 |
30
+ | gte-base-en-v1.5 | 0.3454 | 0.3899 | 0.4554 |
31
+ | snowflake-arctic-embed-m-v1.5 | **0.3548** | 0.4042 | 0.4573 |
32
+ | LayoutLM-Byne (our model) | 0.3491 | **0.4269** | **0.5436** |
33
+ | Improvement over best competitor| -1.61% | +5.62% | +18.87% |
34
 
35
  ### Usage
36
  Please refer to the Colab workbook or the blog post to learn more!