EconPapers    
Economics at your fingertips  
 

An Improved Machine Learning-Based Employees Attrition Prediction Framework with Emphasis on Feature Selection

Saeed Najafi-Zangeneh, Naser Shams-Gharneh, Ali Arjomandi-Nezhad and Sarfaraz Hashemkhani Zolfani
Additional contact information
Saeed Najafi-Zangeneh: Industrial Engineering Department, Amirkabir University of Technology, Tehran 15875-4413, Iran
Naser Shams-Gharneh: Industrial Engineering Department, Amirkabir University of Technology, Tehran 15875-4413, Iran
Ali Arjomandi-Nezhad: Industrial Engineering and Productivity Research Center, Amirkabir University of Technology, Tehran 15875-4413, Iran
Sarfaraz Hashemkhani Zolfani: School of Engineering, Catholic University of the North, Larrondo 1281, 1780000 Coquimbo, Chile

Mathematics, 2021, vol. 9, issue 11, 1-14

Abstract: Companies always seek ways to make their professional employees stay with them to reduce extra recruiting and training costs. Predicting whether a particular employee may leave or not will help the company to make preventive decisions. Unlike physical systems, human resource problems cannot be described by a scientific-analytical formula. Therefore, machine learning approaches are the best tools for this aim. This paper presents a three-stage (pre-processing, processing, post-processing) framework for attrition prediction. An IBM HR dataset is chosen as the case study. Since there are several features in the dataset, the “max-out” feature selection method is proposed for dimension reduction in the pre-processing stage. This method is implemented for the IBM HR dataset. The coefficient of each feature in the logistic regression model shows the importance of the feature in attrition prediction. The results show improvement in the F1-score performance measure due to the “max-out” feature selection method. Finally, the validity of parameters is checked by training the model for multiple bootstrap datasets. Then, the average and standard deviation of parameters are analyzed to check the confidence value of the model’s parameters and their stability. The small standard deviation of parameters indicates that the model is stable and is more likely to generalize well.

Keywords: machine learning; human resource management; feature selection; logistic regression; attrition prediction; bootstrap (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2021
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2227-7390/9/11/1226/pdf (application/pdf)
https://www.mdpi.com/2227-7390/9/11/1226/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:9:y:2021:i:11:p:1226-:d:563759

Access Statistics for this article

Mathematics is currently edited by Ms. Emma He

More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jmathe:v:9:y:2021:i:11:p:1226-:d:563759