kold_binary / README.md
JunHwi's picture
Create README.md
a6e45cd
|
raw
history blame
328 Bytes

Pretraining KoLD Dataset with pretrained "koelectra-v3" model.

dataset : https://github.com/boychaboy/KOLD

pretrained_model : https://huggingface.co/monologg/koelectra-base-v3-discriminator

So you should use tokenizer with "koelectra-base-v3-discriminator".

label maps are like

{0: "not_hate_speech", 1: "hate_speech"}