detexd-roberta-base / README.md
syavnyi's picture
Update citation
7344702
|
raw
history blame
2.75 kB
metadata
license: apache-2.0
language:
  - en
pipeline_tag: text-classification

DeTexD-RoBERTa-base delicate text detection

This is a baseline RoBERTa-base model for the delicate text detection task.

The labels meaning according to the paper:

  • LABEL_0 -> non-delicate (0)
  • LABEL_1 -> very low risk (1)
  • LABEL_2 -> low risk (2)
  • LABEL_3 -> medium risk (3)
  • LABEL_4 -> high risk (4)
  • LABEL_5 -> very high risk (5)

Classification example code

Here's a short usage example with the torch library in a binary classification task:

from transformers import pipeline

classifier = pipeline("text-classification", model="grammarly/detexd-roberta-base")

def predict_binary_score(text: str):
    # get multiclass probability scores
    scores = classifier(text, top_k=None)

    # convert to a single score by summing the probability scores
    # for the higher-index classes
    return sum(score['score']
               for score in scores
               if score['label'] in ('LABEL_3', 'LABEL_4', 'LABEL_5'))

def predict_delicate(text: str, threshold=0.72496545):
    return predict_binary_score(text) > threshold

print(predict_delicate("Time flies like an arrow. Fruit flies like a banana."))

Expected output:

False

Citation Information

@inproceedings{chernodub-etal-2023-detexd,
    title = "{D}e{T}ex{D}: A Benchmark Dataset for Delicate Text Detection",
    author = "Yavnyi, Serhii and Sliusarenko, Oleksii  and Razzaghi, Jade  and Mo, Yichen  and Hovakimyan, Knar and Chernodub, Artem",
    booktitle = "The 7th Workshop on Online Abuse and Harms (WOAH)",
    month = jul,
    year = "2023",
    address = "Toronto, Canada",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2023.woah-1.2",
    pages = "14--28",
    abstract = "Over the past few years, much research has been conducted to identify and regulate toxic language. However, few studies have addressed a broader range of sensitive texts that are not necessarily overtly toxic. In this paper, we introduce and define a new category of sensitive text called {``}delicate text.{''} We provide the taxonomy of delicate text and present a detailed annotation scheme. We annotate DeTexD, the first benchmark dataset for delicate text detection. The significance of the difference in the definitions is highlighted by the relative performance deltas between models trained each definitions and corpora and evaluated on the other. We make publicly available the DeTexD Benchmark dataset, annotation guidelines, and baseline model for delicate text detection.",
}