bgonzalezbustamante
/

ft-roberta-toxicity

Text Classification

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

bgonzalezbustamante commited on 21 days ago

Commit

11ac6c9

•

1 Parent(s): 0d84fdf

Update README.md

Files changed (1) hide show

README.md +6 -8

README.md CHANGED Viewed

@@ -21,8 +21,8 @@ This is a fine-tuned roBERTa-base model trained using as a base model Twitter-ro
 The dataset comprises almost 5M data points from three Latin American protest events: (a) protests against the coronavirus and judicial reform measures in Argentina during August 2020; (b) protests against education budget cuts in Brazil in May 2019; and (c) the social outburst in Chile stemming from protests against the underground fare hike in October 2019. We are focusing on interactions in Spanish to elaborate a gold standard for digital interactions in this language, therefore, we prioritise Argentinian and Chilean data.
 - [GitHub repository](https://github.com/training-datalab/gold-standard-toxicity).
-- [Dataset on Zenodo](https://zenodo.org/doi/10.5281/zenodo.12574288).
-- [Reference paper](https://doi.org/10.48550/arXiv.2409.09741).
 **Labels: NONTOXIC and TOXIC.**
@@ -32,12 +32,10 @@ WIP
 ## Validation Metrics
-| Metric | Value  |
-|---|---|
-| Accuracy  | 0.790 |
-| Precision | 0.920 |
-| Reccall  | 0.657 |
-| F1-Score  | 0.767 |
 ## License

 The dataset comprises almost 5M data points from three Latin American protest events: (a) protests against the coronavirus and judicial reform measures in Argentina during August 2020; (b) protests against education budget cuts in Brazil in May 2019; and (c) the social outburst in Chile stemming from protests against the underground fare hike in October 2019. We are focusing on interactions in Spanish to elaborate a gold standard for digital interactions in this language, therefore, we prioritise Argentinian and Chilean data.
 - [GitHub repository](https://github.com/training-datalab/gold-standard-toxicity).
+- [Dataset on Zenodo](zenodo.org/doi/10.5281/zenodo.12574288).
+- [Reference paper](arxiv.org/abs/2409.09741)
 **Labels: NONTOXIC and TOXIC.**
 ## Validation Metrics
+- Accuracy: 0.790
+- Precision: 0.920
+- Reccall: 0.657
+- F1-Score: 0.767
 ## License