Kansallisarkisto
/

finbert-ner

Token Classification

Inference Endpoints

Model card Files Files and versions Community

MikkoLipsanen commited on Jun 28, 2023

Commit

25dc135

•

1 Parent(s): ccdf3e0

Update README.md

Files changed (1) hide show

README.md +13 -5

README.md CHANGED Viewed

@@ -61,14 +61,22 @@ digitized documents from Finnish public administration was also used for model t
 entity classes contained in training, validation and test datasets are listed below:
 Number of entity types in the data
-Dataset|O|PERSON|ORG|LOC|GPE|PRODUCT|EVENT|DATE|JON|FIBC|NORP
--|-|-|-|-|-|-|-|-|-|-|-
-Train|0|0|0|0|0|0|0|0|0|0|0
-Val|0|0|0|0|0|0|0|0|0|0|0
-Test|0|0|0|0|0|0|0|0|0|0|0
 ## Training procedure
 This model was trained using a NVIDIA RTX A6000 GPU with the following hyperparameters:
 The training code with instructions is available [here](https://github.com/DALAI-hanke/BERT_NER).

 entity classes contained in training, validation and test datasets are listed below:
 Number of entity types in the data
+Dataset|PERSON|ORG|LOC|GPE|PRODUCT|EVENT|DATE|JON|FIBC|NORP
+-|-|-|-|-|-|-|-|-|-|-
+Train|0|0|0|0|0|0|0|0|0|0
+Val|1560|4077|108|1643|880|165|1897|185|265|299
+Test|1284|3742|87|1713|906|137|1864|179|234|261
 ## Training procedure
 This model was trained using a NVIDIA RTX A6000 GPU with the following hyperparameters:
+- learning rate: 2e-05
+- train batch size: 16
+- epochs: 10
+- optimizer: AdamW with betas=(0.9,0.999) and epsilon=1e-08
+- scheduler: linear scheduler with num_warmup_steps=round(len(train_dataloader)/5) and num_training_steps=len(train_dataloader)*epochs
+- maximum length of data sequence: 512
+- patience: 2 epochs
 The training code with instructions is available [here](https://github.com/DALAI-hanke/BERT_NER).