Automatic Identification of Political Ideology in Online News Articles
Vikki Fiore ()
Additional contact information
Vikki Fiore: Montclair State University, Montclair, NJ, USA
RAIS Journal for Social Sciences, 2019, vol. 3, issue 2, 50-54
Abstract:
The following research discusses text analysis approaches to automatically categorize news articles based on their political ideology. In this case, ideology is defined as a writer expressing either a liberal or a conservative point of view. This classification is done at both the document and the phrase level, as previous research has indicated that doing so increases classifier performance over using a “bag of words†approach. Linguistic features related to lexical richness are extracted from the articles via Python, and features related to emotions and values are extracted via the Linguistic Inquiry and Word Count software. The machine learning software Weka is then used to apply various classification algorithms on the numeric features. Additionally, Amazon Mechanical Turk is used to measure human accuracy and inter-rater agreement on identifying the ideology of the same texts. In all, the trained classifiers perform well above the baseline and outperform the human annotators on the same tasks.
Keywords: text analysis; natural language processing; applied linguistics; politics; bias detection; machine learning; artificial intelligence (search for similar items in EconPapers)
Date: 2019
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
http://journal.rais.education/index.php/raiss/article/view/75/61 (application/pdf)
http://journal.rais.education/index.php/raiss/article/view/75 (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:smo:jornl1:v:3:y:2019:i:2:p:50-54
Access Statistics for this article
More articles in RAIS Journal for Social Sciences from Research Association for Interdisciplinary Studies
Bibliographic data for series maintained by Eduard David ().