Mapping the Media Landscape: Predicting Factual Reporting and Political Bias Through Web Interactions
Abstract
Bias assessment of news sources is paramount for professionals, organizations, and researchers who rely on truthful evidence for information gathering and reporting. While certain bias indicators are discernible from content analysis, descriptors like political bias and fake news pose greater challenges. In this paper, we propose an extension to a recently presented news media reliability estimation method that focuses on modeling outlets and their longitudinal web interactions. Concretely, we assess the classification performance of four reinforcement learning strategies on a large news media hyperlink graph. Our experiments, targeting two challenging bias descriptors, factual reporting and political bias, showed a significant performance improvement at the source media level. Additionally, we validate our methods on the CLEF 2023 CheckThat! Lab challenge, outperforming the reported results in both, F1-score and the official MAE metric. Furthermore, we contribute by releasing the largest annotated dataset of news source media, categorized with factual reporting and political bias labels. Our findings suggest that profiling news media sources based on their hyperlink interactions over time is feasible, offering a bird's-eye view of evolving media landscapes.
Community
May be interesting for those working on information verification field (fake news detection, fact-checking, etc.). Along with the paper a dataset is released with annotation at news source level. More precisely, along with the domain name of the new media outlets, political bias and factual reporting labels are provided.
The dataset is available via hugging face here.
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- CrediRAG: Network-Augmented Credibility-Based Retrieval for Misinformation Detection in Reddit (2024)
- E2MoCase: A Dataset for Emotional, Event and Moral Observations in News Articles on High-impact Legal Cases (2024)
- Yesterday's News: Benchmarking Multi-Dimensional Out-of-Distribution Generalisation of Misinformation Detection Models (2024)
- Detection of Human and Machine-Authored Fake News in Urdu (2024)
- Enriching GNNs with Text Contextual Representations for Detecting Disinformation Campaigns on Social Media (2024)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 1
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper