Machine learning-based risk prediction model for cognitive dysfunction in elderly individuals
Lei Zhang,
Xuan Xiang,
Wei Chen,
Haijun Miao,
Ting Zou,
Ruikai Wu and
Xiaohui Zhou
PLOS ONE, 2025, vol. 20, issue 12, 1-19
Abstract:
Background: With the advancement of globalization, the prevalence of cognitive dysfunction in the elderly population has risen significantly. Early intervention may dramatically alleviate the disease burden and reduce economic costs associated with cognitive impairment. This study aims to construct a risk prediction model for cognitive dysfunction based on machine learning (ML) algorithms, providing healthcare professionals and patients with a more accurate and effective tool for risk assessment. Methods: This study included 1,325 elderly participants who completed cognitive assessments and comprehensive laboratory blood tests. Risk factors for cognitive dysfunction were identified through univariate analysis, multivariate logistic regression, LASSO regression, and the Boruta algorithm. Nine ML methods—Random Forest (RF), Light Gradient Boosting Machine (LightGBM), Extreme Gradient Boosting (XGBoost), Logistic Regression, K-Nearest Neighbor (KNN), Support Vector Machine (SVM), Artificial Neural Network (ANN), Decision Tree, and Elastic Net—were employed to construct the prediction models. The Shapley Additive Explanations (SHAP) algorithm was utilized to interpret the final model. Results: The Random Forest model exhibited the highest predictive performance, with an AUC value exceeding those of other models. SHAP analysis identified age, race, education level, diabetes, and depression as the primary predictors of cognitive dysfunction in the elderly. The calibration curve indicated a strong alignment between the model’s predictions and actual outcomes, while the decision curve confirmed the model’s clinical applicability. Conclusion: Age, race, education level, diabetes, and depression are significant influencing factors of cognitive dysfunction in the elderly. Among the ML algorithms evaluated, the Random Forest model exhibited the best predictive performance.
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0336058 (text/html)
https://journals.plos.org/plosone/article/file?id= ... 36058&type=printable (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:plo:pone00:0336058
DOI: 10.1371/journal.pone.0336058
Access Statistics for this article
More articles in PLOS ONE from Public Library of Science
Bibliographic data for series maintained by plosone ().