Bio, psycho, or social: supervised machine learning to classify discursive framing of depression in online health communities
Renáta Németh (),
Fanni Máté,
Eszter Katona,
Márton Rakovics and
Domonkos Sik
Additional contact information
Renáta Németh: Eötvös Loránd University of Budapest
Fanni Máté: Eötvös Loránd University of Budapest
Eszter Katona: Eötvös Loránd University of Budapest
Márton Rakovics: Eötvös Loránd University of Budapest
Domonkos Sik: Eötvös Loránd University of Budapest
Quality & Quantity: International Journal of Methodology, 2022, vol. 56, issue 6, No 3, 3933-3955
Abstract:
Abstract Supervised machine learning on textual data has successful industrial/business applications, but it is an open question whether it can be utilized in social knowledge building outside the scope of hermeneutically more trivial cases. Combining sociology and data science raises several methodological and epistemological questions. In our study the discursive framing of depression is explored in online health communities. Three discursive frameworks are introduced: the bio-medical, psychological, and social framings of depression. ~80 000 posts were collected, and a sample of them was manually classified. Conventional bag-of-words models, Gradient Boosting Machine, word-embedding-based models and a state-of-the-art Transformer-based model with transfer learning, called DistilBERT were applied to expand this classification on the whole database. According to our experience ‘discursive framing’ proves to be a complex and hermeneutically difficult concept, which affects the degree of both inter-annotator agreement and predictive performance. Our finding confirms that the level of inter-annotator disagreement provides a good estimate for the objective difficulty of the classification. By identifying the most important terms, we also interpreted the classification algorithms, which is of great importance in social sciences. We are convinced that machine learning techniques can extend the horizon of qualitative text analysis. Our paper supports a smooth fit of the new techniques into the traditional toolbox of social sciences.
Keywords: Supervised machine learning; Online forums; Natural language processing; Framing of depression; Gradient boosting machine; Transformer-based model (search for similar items in EconPapers)
Date: 2022
References: View complete reference list from CitEc
Citations: View citations in EconPapers (1)
Downloads: (external link)
http://link.springer.com/10.1007/s11135-021-01299-0 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:qualqt:v:56:y:2022:i:6:d:10.1007_s11135-021-01299-0
Ordering information: This journal article can be ordered from
http://www.springer.com/economics/journal/11135
DOI: 10.1007/s11135-021-01299-0
Access Statistics for this article
Quality & Quantity: International Journal of Methodology is currently edited by Vittorio Capecchi
More articles in Quality & Quantity: International Journal of Methodology from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().