EconPapers    
Economics at your fingertips  
 

Supervised Classification of Healthcare Text Data Based on Context-Defined Categories

Sergio Bolívar, Alicia Nieto-Reyes and Heather L. Rogers
Additional contact information
Sergio Bolívar: Department of Mathematics, Statistics and Computer Science, Universidad de Cantabria, 39005 Santander, Spain
Alicia Nieto-Reyes: Department of Mathematics, Statistics and Computer Science, Universidad de Cantabria, 39005 Santander, Spain
Heather L. Rogers: Biocruces Bizkaia Health Research Institute, 48903 Barakaldo, Spain

Mathematics, 2022, vol. 10, issue 12, 1-31

Abstract: Achieving a good success rate in supervised classification analysis of a text dataset, where the relationship between the text and its label can be extracted from the context, but not from isolated words in the text, is still an important challenge facing the fields of statistics and machine learning. For this purpose, we present a novel mathematical framework. We then conduct a comparative study between established classification methods for the case where the relationship between the text and the corresponding label is clearly depicted by specific words in the text. In particular, we use logistic LASSO, artificial neural networks, support vector machines, and decision-tree-like procedures. This methodology is applied to a real case study involving mapping Consolidated Framework for Implementation and Research (CFIR) constructs to health-related text data and achieves a prediction success rate of over 80% when just the first 55% of the text, or more, is used for training and the remaining for testing. The results indicate that the methodology can be useful to accelerate the CFIR coding process.

Keywords: artificial neural networks; decision tree; logistic LASSO; natural language processing; qualitative data; supervised classification; support vector machines; text data analysis (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
https://www.mdpi.com/2227-7390/10/12/2005/pdf (application/pdf)
https://www.mdpi.com/2227-7390/10/12/2005/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:10:y:2022:i:12:p:2005-:d:835883

Access Statistics for this article

Mathematics is currently edited by Ms. Emma He

More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jmathe:v:10:y:2022:i:12:p:2005-:d:835883