File size: 649 Bytes
031049e
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
---

language:
- bg
metrics:
- f1
- accuracy
- precision
- recall
base_model:
- google-bert/bert-base-uncased
pipeline_tag: text-classification
---

Toxic language classification model of Bulgarian language, based on the [bert-base-bg](https://huggingface.co/rmihaylov/bert-base-bg) model. 

The model classifies between 4 classes: Toxic, MedicalTerminology, NonToxic, MinorityGroup. 

Classification report: 

| Accuracy | Precision | Recall | F1 Score | Loss Function |
|----------|-----------|--------|----------|---------------|
| 0.85     | 0.86      | 0.85   | 0.85     | 0.43          |

More information in the paper (coming soon).