azminetoushikwasi commited on
Commit
796b53e
1 Parent(s): 85c036a

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +36 -0
README.md ADDED
@@ -0,0 +1,36 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ language:
4
+ - bn
5
+ base_model:
6
+ - sagorsarker/bangla-bert-base
7
+ pipeline_tag: text-classification
8
+ datasets:
9
+ - ciol-research/Dhoroni
10
+ metrics:
11
+ - accuracy
12
+ - precision
13
+ - recall
14
+ - f1
15
+ library_name: transformers
16
+ tags:
17
+ - Environment
18
+ - Climate Change
19
+ - News and Media
20
+ - Bangla
21
+ - Bengali
22
+ ---
23
+
24
+ ## *Dhoroni*: Exploring Bengali Climate Change and Environmental Views with a Multi-Perspective News Dataset and Natural Language Processing
25
+ - **Authors**: Azmine Toushik Wasi, Wahid Faisal, Taj Ahmad, Abdur Rahman, Mst Rafia Islam
26
+ - **Dataset DOI (Zenodo)**: https://doi.org/10.5281/zenodo.13695110
27
+ - **Task in this model**: News Authenticity Identification
28
+ - **Abstract**: Climate change poses critical challenges globally, disproportionately affecting low-income countries that often lack resources and linguistic representation on the international stage. Despite Bangladesh's status as one of the most vulnerable nations to climate impacts, research gaps persist in Bengali-language studies related to climate change and NLP. To address this disparity, we introduce ধরণী (*Dhoroni*), a novel Bengali (Bangla) climate change and environmental news dataset, comprising a 2300 annotated Bangla news articles, offering multiple perspectives such as political influence, scientific/statistical data, authenticity, stance detection, and stakeholder involvement. Furthermore, we present an in-depth exploratory analysis of *Dhoroni* and introduce *BanglaBERT-Dhoroni* family, a novel baseline family for climate stance detection in Bangla, fine-tuned on our dataset. This research contributes significantly to enhancing accessibility and analysis of climate discourse in Bengali (Bangla), addressing crucial communication and research gaps in climate-impacted regions like Bangladesh with 180 million people.
29
+
30
+
31
+ ### Highlights
32
+ - We introduce Dhoroni, a novel benchmark dataset with 2,300 Bengali news articles.
33
+ - The dataset is annotated by three annotators across ten different perspectives.
34
+ - Detailed exploratory analysis and reasoning for each perspective are provided.
35
+ - We present ten baseline models under BanglaBERT-Dhoroni family for different tasks.
36
+ - Benchmarking scores show stable performance in different task