l3cube-pune's picture
Update README.md
74ea6ad verified
|
raw
history blame
1.81 kB
metadata
language: mr
tags:
  - bert
license: cc-by-4.0
datasets:
  - L3Cube-MahaNews-SHC
widget:
  - text: >-
      IND vs IRE : आयर्लंडच्या दौऱ्यासाठी कसा आहे भारतीय संघ, जाणून घ्या कोणाला
      मिळाली संधी...

MahaNews-SHC-BERT

MahaNews-SHC-BERT is a MahaBERT(l3cube-pune/marathi-bert-v2) model fine-tuned on full L3Cube-MahaNews-SHC Corpus, a Marathi short text / news headlines classification dataset.
It is a topic identification cum short text classification model with 12 output categories
[dataset link] (https://github.com/l3cube-pune/MarathiNLP)

More details on the dataset, models, and baseline results can be found in our [paper] (coming soon)
Citing:

@inproceedings{mittal2023l3cube,
  title={L3Cube-MahaNews: News-Based Short Text and Long Document Classification Datasets in Marathi},
  author={Mittal, Saloni and Magdum, Vidula and Hiwarkhedkar, Sharayu and Dhekane, Omkar and Joshi, Raviraj},
  booktitle={International Conference on Speech and Language Technologies for Low-resource Languages},
  pages={52--63},
  year={2023},
  organization={Springer}
}

Other Marathi Sentiment models from MahaNews family are shared here:

MahaNews-LDC-BERT (long documents)
MahaNews-SHC-BERT (short text)
MahaNews-LPC-BERT (medium paragraphs)
MahaNews-All-BERT (all document lengths)