We Can Detect Your Bias: Predicting the Political Ideology of News Articles

We explore the task of predicting the leading political ideology or bias of news articles. First, we collect and release a large dataset of 34,737 articles that were manually annotated for political ideology -left, center, or right-, which is well-balanced across both topics and media. We further use a challenging experimental setup where the test examples come from media that were not seen during training, which prevents the model from learning to detect the source of the target news article instead of predicting its political ideology. From a modeling perspective, we propose an adversarial media adaptation, as well as a specially adapted triplet loss. We further add background information about the source, and we show that it is quite helpful for improving article-level prediction. Our experimental results show very sizable improvements over using state-of-the-art pre-trained Transformers in this challenging setup.

We Can Detect Your Bias: Predicting the Political Ideology of News Articles

Baly, Ramy;Da San Martino, Giovanni;Glass, James;Nakov, Preslav

2020

Abstract

We explore the task of predicting the leading political ideology or bias of news articles. First, we collect and release a large dataset of 34,737 articles that were manually annotated for political ideology -left, center, or right-, which is well-balanced across both topics and media. We further use a challenging experimental setup where the test examples come from media that were not seen during training, which prevents the model from learning to detect the source of the target news article instead of predicting its political ideology. From a modeling perspective, we propose an adversarial media adaptation, as well as a specially adapted triplet loss. We further add background information about the source, and we show that it is quite helpful for improving article-level prediction. Our experimental results show very sizable improvements over using state-of-the-art pre-trained Transformers in this challenging setup.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2020
			
	Titolo del Libro
	
				Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
			
	Titolo convegno
	
				2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
			
	Codice DOI
	
				https://dx.doi.org/10.18653/v1/2020.emnlp-main.404
			
	Codice WOS
	
				WOS:000855160705014
			
	Codice Scopus
	
				2-s2.0-85108584483
			
	Codice OpenAlex
	
				W3101295217
			
	Codice ISBN
	
				9781952148606
			
	Appare nelle tipologie:
	
				04.01 - Contributo in atti di convegno

File in questo prodotto:

File	Dimensione	Formato
2020.emnlp-main.404.pdf accesso aperto Tipologia: Published (Publisher's Version of Record) Licenza: Creative commons Dimensione 1.31 MB Formato Adobe PDF Visualizza/Apri	1.31 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11577/3368821

Citazioni

ND

118

89

ND

social impact