Machine Learning Models for Predicting Employee Attrition: A Data Science Perspective
Anber Abraheem Shlash Mohammad,
Zeyad Alkhazali,
Suleiman Ibrahim Shelash Mohammad,
Badrea Al Oraini,
Asokan Vasudevan,
Menahi Mosallam Alqahtani and
Muhammad Turki Alshurideh
Data and Metadata, 2025, vol. 4, 669
Abstract:
Introduction: Employee attrition poses significant challenges for organizations, impacting productivity and profitability. This study explores attrition patterns using machine learning models, integrating predictive analytics with established human resource theories to identify key drivers of workforce turnover. Methods: The research analysed a dataset comprising demographic, job-related, and engagement factors. Logistic Regression was employed as the baseline model to interpret linear relationships, while Random Forest and Decision Trees captured non-linear interactions. Performance metrics such as accuracy, precision, recall, F1-score, and AUC-ROC were used to evaluate model effectiveness, alongside feature importance analysis for actionable insights. Results: Results revealed that job satisfaction, tenure, departmental dynamics, and engagement levels are critical predictors of attrition. Random Forest emerged as the most effective model, achieving an accuracy of 92% and an AUC-ROC of 94%, highlighting its capability to capture complex patterns. Decision Trees provided interpretable decision rules, offering practical thresholds for HR interventions. Logistic Regression complemented these models by offering insights into direct, linear relationships between predictors and attrition. Conclusion: The study finds that machine learning improves attrition analysis by identifying complex patterns and enabling proactive retention strategies. Predictive analytics strengthens traditional theories, providing a structured approach to reducing employee turnover. Organizations can use these insights to enhance workforce stability and performance. Future research could incorporate qualitative methods and longitudinal studies to refine strategies and assess long-term impacts.
Date: 2025
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:dbk:datame:v:4:y:2025:i::p:669:id:1056294dm2025669
DOI: 10.56294/dm2025669
Access Statistics for this article
More articles in Data and Metadata from AG Editor
Bibliographic data for series maintained by Javier Gonzalez-Argote ().