Cardiovascular risk prediction and influencing predictors identification among Bangladeshi individuals using machine learning algorithms and association rule mining
Md Merajul Islam,
Sujit Kumar,
Md A Salam,
Dulal Chandra Roy and
Md Rezaul Karim
PLOS ONE, 2025, vol. 20, issue 10, 1-18
Abstract:
Background: Cardiovascular disease (CVD) encompasses a group of disorders that affect the heart and blood vessels, making it one of the leading causes of death globally, including in Bangladesh. Applying predictive modeling for the early identification and detection of CVD holds significant promise for saving lives by enhancing prediction precision through machine learning algorithms. Therefore, this study aimed to predict high-risk individuals for CVD using machine learning algorithms and identify its influencing predictors by association mining rules among individuals in Bangladesh. Materials and methods: This study utilized the most recent Bangladesh Demographic and Health Survey (BDHS) 2022 data, which encompassed 2,221 respondents. A Boruta-based feature selection method is employed to determine the important features associated with the high risk of CVD. Different machine learning algorithms, including logistic regression, Naïve Bayes, artificial neural network, random forest, and extreme gradient boosting (XGB), are adopted to predict the high-risk individuals for CVD in the training dataset. The predictive performance of the models is evaluated using accuracy, precision, recall, F1-score, and area under the curve (AUC) in the testing set. Additionally, the most significant rules are analyzed using the association mining technique to identify the influencing predictors of high risk of CVD. Results: The Boruta method indicated that age, residence, marital status, wealth, having an air conditioner (AC), and body mass index (BMI) are important predictors of high risk of CVD. The XGB-based predictive model achieves impressive performance compared to other models, with an accuracy of 68.22%, precision of 69.70%, F1-score of 79.54%, and AUC of 0.721. The association rules identified that being aged 65 or older, living in an urban area, having the richest wealth status, having AC, and being widowed are the influencing predictors of high risk of CVD. Conclusions: This study emphasizes the potential of XGB in predicting high-risk individuals for CVD and enhances the investigation of key factors contributing to CVD risk in this population, thereby facilitating the development of targeted prevention strategies that can effectively mitigate the high CVD risk.
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0333913 (text/html)
https://journals.plos.org/plosone/article/file?id= ... 33913&type=printable (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:plo:pone00:0333913
DOI: 10.1371/journal.pone.0333913
Access Statistics for this article
More articles in PLOS ONE from Public Library of Science
Bibliographic data for series maintained by plosone ().