EconPapers    
Economics at your fingertips  
 

IMPORTANCE MEASUREMENT OF THE INFLUENCING FACTORS OF LONG-TERM NURSING STATUS IN LONG-TERM NURSING INSURANCE BASED ON MULTIPLE LINEAR REGRESSION, RANDOM FOREST AND XGBOOST MODELS

Yanhan Ji and Xiangdong Liu
Additional contact information
Yanhan Ji: School of Economics, Jinan University, Guangzhou 510623, P. R. China
Xiangdong Liu: School of Economics, Jinan University, Guangzhou 510623, P. R. China

FRACTALS (fractals), 2023, vol. 31, issue 06, 1-14

Abstract: Long-term care for the elderly has become one of the prominent social problems globally when the ratios of persons whose ages over 65 steadily increase in almost all countries. One of the solution approaches that could be adapted is called long-term care insurance provided by insurance companies. However, companies need to classify care status types based on price or to provide supports utilizing its organizational structures such as departmental communication, business selection, and market segmentation since long-term care consists of many factors. The motivation of this research aims at filling the gap since there exists no comprehensive research concerning these factors that have impacts on the long-term care status for the elderly. To determine those factors, machine learning (ML) algorithms such as multiple linear regression, random forest, and the XGBoost are selected to be employed. Then, those factors and their important variables are utilized to predict insurance pricing. The 2018 Chinese (CHARLS) data set is used to determine factors that have key impacts on long-term care status in the elderly. Finally, all models are combined as a comprehensive model to generate better prediction accuracies innovatively. The results show that the three ML models can provide relatively consistent important measures of risk factors in determining the nursing status of the elderly. On the other hand, the prediction accuracy of the random forest and the XGBoost was improved by 0.6% and 1%, respectively, when compared to multiple linear regression. Besides, the results show that when the ratios of 2.6, 3.7, 3.7 are assigned to the results of the three models, the prediction accuracy of the comprehensive model is higher in the test set than that of the multiple linear regression, which contributes 1.92% more. The main innovation of this research is to construct a comprehensive model, a weighted combination of three models, with better prediction accuracy. Eventually, the long-term care insurance business can utilize the comprehensive model to classify the long-term care status of the elderly.

Keywords: Long-Term Care Insurance; Long-Term Care Status; Variable Importance; XGBoost; Random Forest; Multiple Linear Regression (search for similar items in EconPapers)
Date: 2023
References: Add references at CitEc
Citations:

Downloads: (external link)
http://www.worldscientific.com/doi/abs/10.1142/S0218348X22401776
Access to full text is restricted to subscribers

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:wsi:fracta:v:31:y:2023:i:06:n:s0218348x22401776

Ordering information: This journal article can be ordered from

DOI: 10.1142/S0218348X22401776

Access Statistics for this article

FRACTALS (fractals) is currently edited by Tara Taylor

More articles in FRACTALS (fractals) from World Scientific Publishing Co. Pte. Ltd.
Bibliographic data for series maintained by Tai Tone Lim ().

 
Page updated 2025-03-20
Handle: RePEc:wsi:fracta:v:31:y:2023:i:06:n:s0218348x22401776