Pretraining KoLD Dataset with pretrained "koelectra-v3" model. dataset : https://github.com/boychaboy/KOLD pretrained_model : https://huggingface.co/monologg/koelectra-base-v3-discriminator So you should use tokenizer with "koelectra-base-v3-discriminator". label maps are like > {0: "not_hate_speech", 1: "hate_speech"}