Beyond a Bag of Words: Using PULSAR to Extract Judgments on Specific Human Rights at Scale
Park Baekkwan,
Colaresi Michael () and
Greene Kevin
Additional contact information
Park Baekkwan: University of Pittsburgh, Department of Political Science, Pittsburgh, United States of America
Colaresi Michael: University of Pittsburgh, Department of Political Science, Pittsburgh, United States of America
Greene Kevin: University of Pittsburgh, Department of Political Science, Pittsburgh, United States of America
Peace Economics, Peace Science, and Public Policy, 2018, vol. 24, issue 4, 7
Abstract:
Sentiment, judgments and expressed positions are crucial concepts across international relations and the social sciences more generally. Yet, contemporary quantitative research has conventionally avoided the most direct and nuanced source of this information: political and social texts. In contrast, qualitative research has long relied on the patterns in texts to understand detailed trends in public opinion, social issues, the terms of international alliances, and the positions of politicians. Yet, qualitative human reading does not scale to the accelerating mass of digital information available currently. Researchers are in need of automated tools that can extract meaningful opinions and judgments from texts. Thus, there is an emerging opportunity to marry the model-based, inferential focus of quantitative methodology, as exemplified by ideal point models, with high resolution, qualitative interpretations of language and positions. We suggest that using alternatives to simple bag of words (BOW) representations and re-focusing on aspect-sentiment representations of text will aid researchers in systematically extracting people’s judgments and what is being judged at scale. The experimental results below show that our approach which automates the extraction of aspect and sentiment MWE pairs, outperforms BOW in classification tasks, while providing more interpretable parameters. By connecting expressed sentiment and the aspects being judged, PULSAR (Parsing Unstructured Language into Sentiment-Aspect Representations) also has deep implications for understanding the underlying dimensionality of issue positions and ideal points estimated with text. Our approach to parsing text into aspects-sentiment expressions recovers both expressive phrases (akin to categorical votes), as well as the aspects that are being judged (akin to bills). Thus, PULSAR or future systems like it, open up new avenues for the systematic analysis of high-dimensional opinions and judgments at scale within existing ideal point models.
Keywords: text analysis; Human Rights; Natural Language Processing (search for similar items in EconPapers)
Date: 2018
References: Add references at CitEc
Citations:
Downloads: (external link)
https://doi.org/10.1515/peps-2018-0030 (text/html)
For access to full text, subscription to the journal or payment for the individual article is required.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bpj:pepspp:v:24:y:2018:i:4:p:7:n:1
Ordering information: This journal article can be ordered from
https://www.degruyter.com/journal/key/peps/html
DOI: 10.1515/peps-2018-0030
Access Statistics for this article
Peace Economics, Peace Science, and Public Policy is currently edited by Raul Caruso
More articles in Peace Economics, Peace Science, and Public Policy from De Gruyter
Bibliographic data for series maintained by Peter Golla ().