EconPapers    
Economics at your fingertips  
 

‘Super-Unsupervised’ Classification for Labelling Text: Online Political Hostility as an Illustration

Stig Hebbelstrup Rye Rasmussen, Alexander Bor, Mathias Osmundsen and Michael Bang Petersen

British Journal of Political Science, 2024, vol. 54, issue 1, 179-200

Abstract: We live in a world of text. Yet the sheer magnitude of social media data, coupled with a need to measure complex psychological constructs, has made this important source of data difficult to use. Researchers often engage in costly hand coding of thousands of texts using supervised techniques or rely on unsupervised techniques where the measurement of predefined constructs is difficult. We propose a novel approach that we call ‘super-unsupervised’ learning and demonstrate its usefulness by measuring the psychologically complex construct of online political hostility based on a large corpus of tweets. This approach accomplishes the feat by combining the best features of supervised and unsupervised learning techniques: measurements of complex psychological constructs without a single labelled data source. We first outline the approach before conducting a diverse series of tests that include: (i) face validity, (ii) convergent and discriminant validity, (iii) criterion validity, (iv) external validity, and (v) ecological validity.

Date: 2024
References: Add references at CitEc
Citations: Track citations by RSS feed

Downloads: (external link)
https://www.cambridge.org/core/product/identifier/ ... type/journal_article link to article abstract page (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:cup:bjposi:v:54:y:2024:i:1:p:179-200_9

Access Statistics for this article

More articles in British Journal of Political Science from Cambridge University Press Cambridge University Press, UPH, Shaftesbury Road, Cambridge CB2 8BS UK.
Bibliographic data for series maintained by Kirk Stebbing (csjnls@cambridge.org).

 
Page updated 2023-12-28
Handle: RePEc:cup:bjposi:v:54:y:2024:i:1:p:179-200_9