Update README.md
Browse files
README.md
CHANGED
@@ -59,11 +59,11 @@ This will return a list of recognized tokens marked with label 'INSTRUCTION'.
|
|
59 |
## Training
|
60 |
|
61 |
It's based on the transformer architecture and specifically uses the [xlm-roberta-base-uk](https://huggingface.co/ukr-models/xlm-roberta-base-uk) model from `ukr-models`, fine-tuned for the token classification task. The training data was carefully chosen to include a balanced distribution of titles containing instructions and those not containing instructions.
|
62 |
-
The dataset contains newspaper titles (~
|
63 |
|
64 |
## Evaluation
|
65 |
|
66 |
-
Model performance was evaluated using a held-out test set, again consisting of manually labeled newspaper titles. F1 - 0.
|
67 |
|
68 |
## License
|
69 |
|
|
|
59 |
## Training
|
60 |
|
61 |
It's based on the transformer architecture and specifically uses the [xlm-roberta-base-uk](https://huggingface.co/ukr-models/xlm-roberta-base-uk) model from `ukr-models`, fine-tuned for the token classification task. The training data was carefully chosen to include a balanced distribution of titles containing instructions and those not containing instructions.
|
62 |
+
The dataset contains newspaper titles (~6k titles), with tokens representing instructions manually labeled.
|
63 |
|
64 |
## Evaluation
|
65 |
|
66 |
+
Model performance was evaluated using a held-out test set, again consisting of manually labeled newspaper titles. F1 - 0.9601, ACCURACY - 0.9968 for the 'INSTRUCTION' label
|
67 |
|
68 |
## License
|
69 |
|