EconPapers    
Economics at your fingertips  
 

Identifying key determinants of health among China’s migrant population using machine learning methods: Evidence from the china migrants dynamic survey

Bo Dong, Yuxin Zhou, Li Wang, Yiyu Wang and Zhenlin Zhang

PLOS ONE, 2025, vol. 20, issue 11, 1-31

Abstract: Background: Continuously improving health security for the migrant population is a key component of China’s healthcare system reform. Existing research indicates that migrant health is influenced by multiple factors, yet the relative importance of these factors remains inadequately measured. This study aims to analyze the current health status of China’s migrant population and rank the primary factors influencing their health based on importance. Methods: Data were sourced from the 2018 China Migrants Dynamic Survey, including 108,669 cases after data cleaning. The health status of the migrant population was initially analyzed using frequency and percentage distributions. Logistic regression was then applied to examine the relationship between various factors and migrant health. Subsequently, six machine learning methods (Neural Network, Random Forest, Support Vector Machine, Gradient Boosting Machine, Extra Trees, and Decision Tree) were applied to rank the importance of these factors. A multidimensional performance metric system (accuracy, precision, recall, F1 score, and AUC value) was employed to comprehensively evaluate the classification performance of the models. SHAP (Shapley Additive Prediction) values were used to illustrate the contribution of different factors to the health status of the migrant population. Results: The health status of China’s migrant population is generally positive, though it is influenced by multiple factors, with varying degrees of significance. Among six distinct machine learning models, the Random Forest model demonstrated the best predictive performance. Its results indicate that the key factors affecting migrant health are age, employment, income, and education level. SHAP value analysis reveals that stable employment, higher education levels, and higher income are positively correlated with better health outcomes, while age was predominantly negatively correlated, indicating a detrimental effect on health status. Conclusion: The overall health status of China’s migrant population is relatively optimistic. However, their disadvantaged positions in areas such as education and income expose them to higher health risks. To address these key determinants, further improvements in health safeguards should focus on: developing stratified intervention strategies based on age structure differences; optimizing work environments and employment security; enhancing health literacy; and strengthening public health emergency management and social support systems.

Date: 2025
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0335168 (text/html)
https://journals.plos.org/plosone/article/file?id= ... 35168&type=printable (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:plo:pone00:0335168

DOI: 10.1371/journal.pone.0335168

Access Statistics for this article

More articles in PLOS ONE from Public Library of Science
Bibliographic data for series maintained by plosone ().

 
Page updated 2025-11-29
Handle: RePEc:plo:pone00:0335168