EconPapers    
Economics at your fingertips  
 

Nonlinear gradient-based feature selection for precise prediction of diseases

Sadaf Kabir and Leily Farrokhvar

International Journal of Data Mining, Modelling and Management, 2022, vol. 14, issue 3, 248-268

Abstract: Developing accurate predictive models can profoundly help healthcare providers improve the quality of their services. However, medical data often contain several variables, and not all the data equally contribute towards the prediction. The existence of irrelevant and redundant features in a dataset can unnecessarily increase computational cost and complexity while deteriorating the performance of the predictive model. In this study, we employ the gradient-based prediction attribution as a general tool to identify important features in differentiable predictive models, such as neural networks (NN) and linear regression. Built upon this approach, we analyse single-stage and multi-stage scenarios for feature selection using ten medical datasets. Through extensive experiments, we demonstrate that the combination of the gradient-based approach with NN provides a powerful nonlinear technique to identify important features contributing to the prediction. In particular, nonlinear gradient-based feature selection achieves competitive results or significant improvements over previously reported results on all datasets.

Keywords: machine learning; feature selection; neural networks; logistic regression; disease prediction models; healthcare data. (search for similar items in EconPapers)
Date: 2022
References: Add references at CitEc
Citations:

Downloads: (external link)
http://www.inderscience.com/link.php?id=125260 (text/html)
Access to full text is restricted to subscribers.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:ids:ijdmmm:v:14:y:2022:i:3:p:248-268

Access Statistics for this article

More articles in International Journal of Data Mining, Modelling and Management from Inderscience Enterprises Ltd
Bibliographic data for series maintained by Sarah Parker ().

 
Page updated 2025-03-19
Handle: RePEc:ids:ijdmmm:v:14:y:2022:i:3:p:248-268