patrickquick
commited on
Commit
•
5489177
1
Parent(s):
0682b0c
Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,26 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
language:
|
3 |
+
- en
|
4 |
+
tags:
|
5 |
+
- BERTicelli
|
6 |
+
- exbert
|
7 |
+
- text classification
|
8 |
+
- abusive language
|
9 |
+
- hate speech
|
10 |
+
- offensive language
|
11 |
+
license: apache-2.0
|
12 |
+
datasets:
|
13 |
+
- bookcorpus
|
14 |
+
- wikipedia
|
15 |
+
- OLID v1.0
|
16 |
+
---
|
17 |
+
|
18 |
+
[Mona Allaert](https://github.com/MonaDT) •
|
19 |
+
[Leonardo Grotti](https://github.com/corvusMidnight) •
|
20 |
+
[Patrick Quick](https://github.com/patrickquick)
|
21 |
+
|
22 |
+
## Model description
|
23 |
+
|
24 |
+
BERTicelli is an English pre-trained BERT model obtained by further training the [English BERT base cased model](https://github.com/google-research/bert) with the training data from [Offensive Language Identification Dataset (OLID)](https://scholar.harvard.edu/malmasi/olid).
|
25 |
+
|
26 |
+
This model was developed for the NLP Shared Task in the Digital Text Analysis program at the University of Antwerp (2021–2022).
|