Multi-Label Prediction for Political Text-as-Data

Erlich, Aaron; Dantas, Stefano G.; Bagozzi, Benjamin E.; Berliner, Daniel; Palmer-Rubin, Brian

Multi-Label Prediction for Political Text-as-Data

Aaron Erlich, Stefano G. Dantas, Benjamin E. Bagozzi, Daniel Berliner and Brian Palmer-Rubin

Political Analysis, 2022, vol. 30, issue 4, 463-480

Abstract: Political scientists increasingly use supervised machine learning to code multiple relevant labels from a single set of texts. The current “best practice” of individually applying supervised machine learning to each label ignores information on inter-label association(s), and is likely to under-perform as a result. We introduce multi-label prediction as a solution to this problem. After reviewing the multi-label prediction framework, we apply it to code multiple features of (i) access to information requests made to the Mexican government and (ii) country-year human rights reports. We find that multi-label prediction outperforms standard supervised learning approaches, even in instances where the correlations among one’s multiple labels are low.

Date: 2022
References: Add references at CitEc
Citations:

Downloads: (external link)
https://www.cambridge.org/core/product/identifier/ ... type/journal_article link to article abstract page (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:cup:polals:v:30:y:2022:i:4:p:463-480_1

Access Statistics for this article

More articles in Political Analysis from Cambridge University Press Cambridge University Press, UPH, Shaftesbury Road, Cambridge CB2 8BS UK.
Bibliographic data for series maintained by Kirk Stebbing ().