MikkoLipsanen commited on
Commit
0976156
1 Parent(s): 340cdcb

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +49 -0
README.md ADDED
@@ -0,0 +1,49 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ pipeline_tag: image-classification
3
+ ---
4
+ ## Table cell classification
5
+
6
+ The model is trained to classify table cell images as either empty or not empty. It has been trained using
7
+ table cell images from Finnish census and death record tables from the 1930s.
8
+
9
+ The model has been trained using [densenet121](https://pytorch.org/vision/stable/models/generated/torchvision.models.densenet121.html) as the base model.
10
+
11
+
12
+ ## Intended uses & limitations
13
+
14
+ The model has been trained to classify table cells from specific kinds of tables, which contain mainly handwritten text.
15
+ It has not been tested with other type of table cell data.
16
+
17
+ ## Training and validation data
18
+
19
+ Training dataset consisted of
20
+
21
+ - empty cell images: 2943
22
+ - non-empty cell images: 5033
23
+
24
+ Validation dataset consisted of
25
+
26
+ - empty cell images: 367
27
+ - non-empty cell images: 627
28
+
29
+ ## Training procedure
30
+
31
+ The code used for model training is available in the repository in `train.py` file, which uses functions from
32
+ `augment.py` and `utils.py` files. The model was trained using cpu with the following hyperparameters:
33
+
34
+ - image size: 2560
35
+ - learning rate: 0.0001
36
+ - train batch size: 32
37
+ - epochs: 15
38
+ - patience: 3 epochs
39
+ - optimizer: Adam
40
+
41
+ ## Evaluation results
42
+
43
+ Evaluation results using the validation dataset are listed below:
44
+ |Validation loss|Validation accuracy|Validation F1-score
45
+ -|-|-
46
+ 0.0427|0.9899|0.9903
47
+
48
+ ## Inference
49
+ Inference can be performed using the code in the `test.py` file.