A Framework for using Machine Learning to Support Qualitative Data Coding
Peter Baumgartner,
Amanda Smith,
Murrey Olmsted and
Dawn Ohse
Additional contact information
Peter Baumgartner: RTI International
No fueyj, OSF Preprints from Center for Open Science
Abstract:
Open-ended survey questions provide qualitative data that are useful for a multitude of reasons. However, qualitative data analysis is labor intensive, and researchers often lack the needed time and resources resulting in underutilization of qualitative data. In attempting to address these issues, we looked to machine learning and recent advances in language models and transfer learning to assist in qualitative coding of responses. We trained a machine learning model following the BERT architecture to predict thematic codes that were then adjudicated by human coders. Results suggest this is a promising approach that can be used to support traditional coding methods and has the potential to alleviate some of the burden associated with qualitative data analysis.
Date: 2021-11-16
New Economics Papers: this item is included in nep-ban, nep-big and nep-cmp
References: Add references at CitEc
Citations:
Downloads: (external link)
https://osf.io/download/6193ab48fe675604bd670b01/
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:osf:osfxxx:fueyj
DOI: 10.31219/osf.io/fueyj
Access Statistics for this paper
More papers in OSF Preprints from Center for Open Science
Bibliographic data for series maintained by OSF ().