Development and validation of a prediction model for long-term cognitive frailty risk in stroke patients based on CHARLS data
Shunli Zuo,
Ning Liu,
Jiaxian Wang,
Jingling Li,
Xiuyuan Zhu and
Yuyang Jia
PLOS ONE, 2026, vol. 21, issue 3, 1-18
Abstract:
Background: This study aimed to develop and validate machine learning (ML) models for predicting the risk of cognitive frailty in community-dwelling elderly adults with stroke. Methods: This study involved 2,325 stroke survivors from the China Health and Retirement Longitudinal Study (CHARLS), conducted between 2018 and 2020. We examined 22 behavioral variables, encompassing indicators from the sociodemographic, physical, psychological, cognitive, and social domains. LASSO regression was employed to identify predictive factors, and eight machine learning models—Logistic Regression, Decision Tree, XGBoost, Support Vector Machine, k-Nearest Neighbors, Naïve Bayes, Random Forest, and LightGBM—were utilized to ascertain the optimal model for predicting cognitive frailty among stroke survivors. SHapley Additive exPlanations (SHAP) values were applied to interpret the contributions of the variables. Results: A total of 2,325 stroke patients were included in the study, among whom 688 (29.59%) exhibited symptoms of cognitive frailty. Of the eight models evaluated, XGBoost (AUC = 0.810) and Random Forest (AUC = 0.795) demonstrated the highest predictive performance for stroke-related cognitive frailty. Key predictors identified were education, nutritional status, physical exercise, Instrumental Activities of Daily Living (IADL), and age, with corresponding SHAP values of 0.28, 0.18, 0.16, 0.21, and 0.32, respectively. The SHAP values indicated that age and education level are the most significant factors in predicting the risk of cognitive frailty in this population. Conclusion: This study developed eight risk prediction models for post-stroke cognitive frailty utilizing machine learning, with the XGBoost algorithm demonstrating superior performance. Leveraging readily available clinical and demographic indicators, the optimized XGBoost model serves as a practical tool for the early screening of cognitive frailty risk among community-dwelling elderly stroke survivors, particularly within primary care settings. This model can aid clinicians in devising targeted intervention strategies to mitigate disease progression and establish a foundation for future prospective studies examining the mechanisms underlying cognitive frailty in stroke populations. Further external validation is necessary to confirm its generalizability across various clinical contexts.
Date: 2026
References: Add references at CitEc
Citations:
Downloads: (external link)
https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0340715 (text/html)
https://journals.plos.org/plosone/article/file?id= ... 40715&type=printable (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:plo:pone00:0340715
DOI: 10.1371/journal.pone.0340715
Access Statistics for this article
More articles in PLOS ONE from Public Library of Science
Bibliographic data for series maintained by plosone ().