File size: 8,018 Bytes
5e404f1
56939e8
5e404f1
 
d9ad60d
 
5e404f1
 
 
 
 
 
 
56939e8
 
 
 
 
 
 
84e5ba1
56939e8
 
 
 
 
 
4d3f9e4
5e404f1
 
56939e8
3d65ee2
 
 
5e404f1
 
 
56939e8
5e404f1
 
 
56939e8
5e404f1
 
 
 
 
56939e8
5e404f1
 
 
56939e8
84e5ba1
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
5e404f1
 
 
56939e8
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
---
license: openrail
tags:
- generated_from_trainer
- document-image-binarization
- image-segmentation
model-index:
- name: binarization-segformer-b3
  results: []
---

# binarization-segformer-b3

This model is a fine-tuned version of [nvidia/segformer-b3-1024-1024](https://huggingface.co/nvidia/segformer-b3-finetuned-cityscapes-1024-1024)
on the same ensemble of 13 datasets as the [SauvolaNet](https://arxiv.org/pdf/2105.05521.pdf) work publicly available 
in their GitHub [repository](https://github.com/Leedeng/SauvolaNet#datasets).

It achieves the following results on the evaluation set on DIBCO metrics:
- loss: 0.0743
- DRD: 5.9548
- F-measure: 0.9840
- pseudo F-measure: 0.9740
- PSNR: 16.0119

with PSNR the peak signal-to-noise ratio and DRD the distance reciprocal distortion.

For more information on the above DIBCO metrics, see the 2017 introductory [paper](https://ieeexplore.ieee.org/document/8270159).

## Model description

This model is part of on-going research on pure semantic segmentation models as a formulation of document image binarization (DIBCO).
This is in contrast to the late trend of adapting classical binarization algorithms with neural networks, 
such as [DeepOtsu](https://arxiv.org/abs/1901.06081) or [SauvolaNet](https://arxiv.org/pdf/2105.05521.pdf)
as extensions of Otsu's method and Sauvola thresholding algorithm, respectively.

## Intended uses & limitations

TBC

## Training and evaluation data

TBC

## Training procedure

### Training hyperparameters

TBC

### Training results

| training loss | epoch | step | validation loss | DRD      | F-measure | pseudo F-measure | PSNR    |
|:-------------:|:-----:|:----:|:---------------:|:--------:|:---------:|:----------------:|:-------:|
| 0.6983        | 0.26  | 10   | 0.7079          | 199.5096 | 0.5945    | 0.5801           | 3.4552  |
| 0.6657        | 0.52  | 20   | 0.6755          | 149.2346 | 0.7006    | 0.6165           | 4.6752  |
| 0.6145        | 0.77  | 30   | 0.6433          | 109.7298 | 0.7831    | 0.6520           | 5.5489  |
| 0.5553        | 1.03  | 40   | 0.5443          | 53.7149  | 0.8952    | 0.8000           | 8.1736  |
| 0.4627        | 1.29  | 50   | 0.4896          | 32.7649  | 0.9321    | 0.8603           | 9.8706  |
| 0.3969        | 1.55  | 60   | 0.4327          | 21.5508  | 0.9526    | 0.8985           | 11.3400 |
| 0.3414        | 1.81  | 70   | 0.3002          | 11.0094  | 0.9732    | 0.9462           | 13.5901 |
| 0.2898        | 2.06  | 80   | 0.2839          | 10.1064  | 0.9748    | 0.9563           | 13.9796 |
| 0.2292        | 2.32  | 90   | 0.2427          | 9.4437   | 0.9761    | 0.9584           | 14.2161 |
| 0.2153        | 2.58  | 100  | 0.2095          | 8.8696   | 0.9771    | 0.9621           | 14.4319 |
| 0.1767        | 2.84  | 110  | 0.1916          | 8.6152   | 0.9776    | 0.9646           | 14.5528 |
| 0.1509        | 3.1   | 120  | 0.1704          | 8.0761   | 0.9791    | 0.9632           | 14.7961 |
| 0.1265        | 3.35  | 130  | 0.1561          | 8.5627   | 0.9784    | 0.9655           | 14.7400 |
| 0.132         | 3.61  | 140  | 0.1318          | 8.1849   | 0.9788    | 0.9670           | 14.8469 |
| 0.1115        | 3.87  | 150  | 0.1317          | 7.8438   | 0.9790    | 0.9657           | 14.9072 |
| 0.0983        | 4.13  | 160  | 0.1273          | 7.9405   | 0.9791    | 0.9673           | 14.9701 |
| 0.1001        | 4.39  | 170  | 0.1234          | 8.4132   | 0.9788    | 0.9691           | 14.8573 |
| 0.0862        | 4.65  | 180  | 0.1147          | 8.0838   | 0.9797    | 0.9678           | 15.0433 |
| 0.0713        | 4.9   | 190  | 0.1134          | 7.6027   | 0.9806    | 0.9687           | 15.2235 |
| 0.0905        | 5.16  | 200  | 0.1061          | 7.2973   | 0.9803    | 0.9699           | 15.1646 |
| 0.0902        | 5.42  | 210  | 0.1061          | 8.4049   | 0.9787    | 0.9699           | 14.8460 |
| 0.0759        | 5.68  | 220  | 0.1062          | 7.7147   | 0.9809    | 0.9695           | 15.2426 |
| 0.0638        | 5.94  | 230  | 0.1019          | 7.7449   | 0.9806    | 0.9695           | 15.2195 |
| 0.0852        | 6.19  | 240  | 0.0962          | 7.0221   | 0.9817    | 0.9693           | 15.4730 |
| 0.0677        | 6.45  | 250  | 0.0961          | 7.2520   | 0.9814    | 0.9710           | 15.3878 |
| 0.0668        | 6.71  | 260  | 0.0972          | 6.6658   | 0.9823    | 0.9689           | 15.6106 |
| 0.0701        | 6.97  | 270  | 0.0909          | 6.9454   | 0.9820    | 0.9713           | 15.5458 |
| 0.0567        | 7.23  | 280  | 0.0925          | 6.5498   | 0.9824    | 0.9718           | 15.5965 |
| 0.0624        | 7.48  | 290  | 0.0899          | 7.3125   | 0.9813    | 0.9717           | 15.3255 |
| 0.0649        | 7.74  | 300  | 0.0932          | 7.4915   | 0.9816    | 0.9684           | 15.5666 |
| 0.0524        | 8.0   | 310  | 0.0905          | 7.1666   | 0.9815    | 0.9711           | 15.4526 |
| 0.0693        | 8.26  | 320  | 0.0901          | 6.5627   | 0.9827    | 0.9704           | 15.7335 |
| 0.0528        | 8.52  | 330  | 0.0845          | 6.6690   | 0.9826    | 0.9734           | 15.5950 |
| 0.0632        | 8.77  | 340  | 0.0822          | 6.2661   | 0.9833    | 0.9723           | 15.8631 |
| 0.0522        | 9.03  | 350  | 0.0844          | 6.0073   | 0.9836    | 0.9715           | 15.9393 |
| 0.0568        | 9.29  | 360  | 0.0817          | 5.9460   | 0.9837    | 0.9721           | 15.9523 |
| 0.057         | 9.55  | 370  | 0.0900          | 7.9726   | 0.9812    | 0.9730           | 15.1229 |
| 0.052         | 9.81  | 380  | 0.0836          | 6.5444   | 0.9822    | 0.9712           | 15.6388 |
| 0.0568        | 10.06 | 390  | 0.0810          | 6.0359   | 0.9836    | 0.9714           | 15.9796 |
| 0.0481        | 10.32 | 400  | 0.0784          | 6.2110   | 0.9835    | 0.9724           | 15.9235 |
| 0.0513        | 10.58 | 410  | 0.0803          | 6.0990   | 0.9835    | 0.9715           | 15.9502 |
| 0.0595        | 10.84 | 420  | 0.0798          | 6.0829   | 0.9835    | 0.9720           | 15.9052 |
| 0.047         | 11.1  | 430  | 0.0779          | 5.8847   | 0.9838    | 0.9725           | 16.0043 |
| 0.0406        | 11.35 | 440  | 0.0802          | 5.7944   | 0.9838    | 0.9713           | 16.0620 |
| 0.0493        | 11.61 | 450  | 0.0781          | 6.0947   | 0.9836    | 0.9731           | 15.9033 |
| 0.064         | 11.87 | 460  | 0.0769          | 6.1257   | 0.9837    | 0.9736           | 15.9080 |
| 0.0622        | 12.13 | 470  | 0.0765          | 6.2964   | 0.9835    | 0.9739           | 15.8188 |
| 0.0457        | 12.39 | 480  | 0.0773          | 5.9826   | 0.9838    | 0.9728           | 16.0119 |
| 0.0447        | 12.65 | 490  | 0.0761          | 5.7977   | 0.9841    | 0.9728           | 16.0900 |
| 0.0515        | 12.9  | 500  | 0.0750          | 5.8569   | 0.9840    | 0.9729           | 16.0633 |
| 0.0357        | 13.16 | 510  | 0.0796          | 5.7990   | 0.9837    | 0.9713           | 16.0818 |
| 0.0503        | 13.42 | 520  | 0.0749          | 5.8323   | 0.9841    | 0.9736           | 16.0510 |
| 0.0508        | 13.68 | 530  | 0.0746          | 6.0361   | 0.9839    | 0.9735           | 15.9709 |
| 0.0533        | 13.94 | 540  | 0.0768          | 6.1596   | 0.9836    | 0.9740           | 15.9193 |
| 0.0503        | 14.19 | 550  | 0.0739          | 5.5900   | 0.9843    | 0.9723           | 16.1883 |
| 0.0515        | 14.45 | 560  | 0.0740          | 5.4660   | 0.9845    | 0.9727           | 16.2745 |
| 0.0502        | 14.71 | 570  | 0.0740          | 5.5895   | 0.9844    | 0.9736           | 16.2054 |
| 0.0401        | 14.97 | 580  | 0.0741          | 5.9694   | 0.9840    | 0.9747           | 15.9603 |
| 0.0495        | 15.23 | 590  | 0.0745          | 5.9136   | 0.9841    | 0.9740           | 16.0458 |
| 0.0413        | 15.48 | 600  | 0.0743          | 5.9548   | 0.9840    | 0.9740           | 16.0119 |

### Framework versions

- transformers 4.31.0
- torch 2.0.0
- datasets 2.13.1
- tokenizers 0.13.3