|
--- |
|
tags: |
|
- spacy |
|
- token-classification |
|
language: uk |
|
datasets: |
|
- ner-uk |
|
license: mit |
|
model-index: |
|
- name: uk_ner_web_trf_large |
|
results: |
|
- task: |
|
name: NER |
|
type: token-classification |
|
metrics: |
|
- name: NER Precision |
|
type: precision |
|
value: 0.9183514774 |
|
- name: NER Recall |
|
type: recall |
|
value: 0.915503876 |
|
- name: NER F Score |
|
type: f_score |
|
value: 0.9169254658 |
|
widget: |
|
- text: "Президент Володимир Зеленський пояснив, що наразі діалог із режимом Володимира путіна неможливий, адже агресор обрав курс на знищення українського народу. За словами Зеленського цей режим РФ виявляє неповагу до суверенітету і територіальної цілісності України." |
|
--- |
|
# uk_ner_web_trf_large |
|
|
|
## Model description |
|
|
|
**uk_ner_web_trf_large** is a fine-tuned [XLM-Roberta model](https://huggingface.co/xlm-roberta-large) that is ready to use for **Named Entity Recognition** and achieves a **SoA** performance for the NER task for Ukrainian language. It outperforms another SpaCy model, [uk_core_news_trf](https://huggingface.co/ukr-models/uk_core_news_trf) on a NER task. |
|
|
|
It has been trained to recognize four types of entities: location (LOC), organizations (ORG), person (PERS) and Miscellaneous (MISC). |
|
|
|
The model was fine-tuned on the [NER-UK dataset](https://github.com/lang-uk/ner-uk), released by the [lang-uk](https://lang.org.ua). |
|
Smaller transformer based model for the SpaCy is available [here](https://huggingface.co/dchaplinsky/uk_ner_web_trf_base). |
|
|
|
|
|
Copyright: [Dmytro Chaplynskyi](https://twitter.com/dchaplinsky), [lang-uk project](https://lang.org.ua), 2022 |
|
|