A Hybrid Predictive Model for Employee Turnover: Integrating Ensemble Learning and Feature-Driven Insights from IBM HR Analytics
Abstract
1. Introduction
- Can a hybrid ensemble framework combining linear, tree-based, and boosting algorithms enhance attrition prediction performance under severe class imbalance?
- Do engineered interaction features—specifically those capturing hierarchical–experiential alignment—improve both predictive accuracy and interpretability?
- Can explainable AI methods such as SHAP meaningfully translate machine learning outputs into actionable insights for HR practitioners?
2. Literature Review
2.1. Evolution of Machine Learning in Human Resource Analytics
2.2. Predictive Modelling for Employee Attrition
2.3. Hybrid Models and Feature-Driven Insights
2.4. Ethical Considerations and Future Directions
3. Methodology
3.1. Data Preprocessing and Feature Engineering
- YearsAtCompany_Ratio: The ratio of YearsAtCompany to TotalWorkingYears, indicating the proportion of an employee’s total career spent with the current company.
- Income_Per_Year: Calculated as MonthlyIncome divided by TotalWorkingYears, representing the monthly income earned per year of overall experience.
- JobLevel_Experience_Interaction: An interaction term created by multiplying JobLevel and TotalWorkingYears to capture the combined effect of seniority and total experience.
3.2. Model Selection and Hybrid Ensemble Design
3.3. Mathematical Models
3.4. Hyperparameter Model
3.5. Evaluation Metrics
3.6. Model Interpretability
4. Analysis and Results
4.1. Model Performance Comparison
4.1.1. Accuracy
4.1.2. Precision and Recall
4.2. XGBoost Model Specifics
Confusion Matrix
- True Negatives (205): The model correctly predicted that 205 employees would not leave.
- True Positives (30): The model correctly identified 30 employees who did quit.
- False Positives (42): The model incorrectly predicted that 42 employees would quit when they did not.
- False Negatives (17): The model failed to identify 17 employees who actually left the company.
4.3. Receiver Operating Characteristic (ROC) Curve
- AUC Score: Tuned XGBoost had a score of AUC = 0.80. AUC = 0.5 is a chance level, and AUC = 1.0 is an ideal classifier. The Area Under the Curve (AUC) is 0.80, which suggests that the model has an 80% chance of correctly distinguishing between an employee who will quit and one who will not. The value of 0.80 shows that the model is strongly capable of distinguishing between the employees who do and do not quit to a medium degree. While this is a strong result, future improvements could aim for higher sensitivity in minority class detection.
4.4. Feature Importance Analysis
- JobLevel Experience Interaction
- Total Working Years
- Over Time
- Monthly Income
5. Discussion
6. Conclusions
7. Future Work and Limitations
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Dash, S.; Mishra, S.; Tripathy, S.K. Contextualization of Employee Recruitment and Retention in Technology Start-Ups. In Palgrave Studies in Democracy, Innovation and Entrepreneurship for Growth; Springer: Berlin/Heidelberg, Germany, 2022; pp. 187–213. [Google Scholar] [CrossRef]
- Raza, A.; Munir, K.; Almutairi, M.; Younas, F.; Fareed, M.M.S. Predicting Employee Attrition Using Machine Learning Approaches. Appl. Sci. 2022, 12, 6424. [Google Scholar] [CrossRef]
- Hamja, A.; Hasan, M.; Hassan, Z.; Uddin, M.P.; Siddikee, M.J.A. An Explainable Machine Learning-Based Employee Attrition Predictive System. Ann. Data Sci. 2025. [Google Scholar] [CrossRef]
- Rajagopal, N.K.; Anand, M.; Mohanty, S. Exploring Machine Learning Applications in Human Resources Management: A Comprehensive Review. Stud. Syst. Decis. Control 2024, 569, 303–313. [Google Scholar] [CrossRef]
- Basnet, S. Artificial Intelligence and Machine Learning in Human Resource Management: Prospect and Future Trends. Int. J. Res. Publ. Rev. 2024, 5, 281–287. [Google Scholar] [CrossRef]
- Pandey, S.; Pandey, S. Leveraging Artificial Intelligence and Machine Learning for Workforce Optimization: A Holistic Approach to HR Transformation. J. Artif. Intell. Mach. Learn. Data Sci. 2025, 3, 2680–2683. [Google Scholar] [CrossRef]
- Salih, A.M.; Raisi-Estabragh, Z.; Galazzo, I.B.; Radeva, P.; Petersen, S.E.; Lekadir, K.; Menegaz, G. A Perspective on Explainable Artificial Intelligence Methods: SHAP and LIME. Adv. Intell. Syst. 2025, 7, 2400304. [Google Scholar] [CrossRef]
- Aldughayfiq, B.; Ashfaq, F.; Jhanjhi, N.Z.; Humayun, M. Explainable AI for Retinoblastoma Diagnosis: Interpreting Deep Learning Models with LIME and SHAP. Diagnostics 2023, 13, 1932. [Google Scholar] [CrossRef]
- Arunika, M.; Saranya, S.; Charulekha, S.; Kabilarajan, S.; Kesavan, G. A Survey on Explainable AI Using Machine Learning Algorithms Shap and Lime. In Proceedings of the 2024 15th International Conference on Computing Communication and Networking Technologies, ICCCNT 2024, Kamand, India, 18–22 June 2024. [Google Scholar] [CrossRef]
- Moscato, V.; Khan, H.W.; Sattar, M.U.; Noor, S.; Alyousef, M.I. A Personality-Informed Candidate Recommendation Framework for Recruitment Using MBTI Typology. Information 2025, 16, 863. [Google Scholar] [CrossRef]
- Wandhe, P. The Transformative Role of Artificial Intelligence in HR: Revolutionizing the Future of HR. SSRN Electron. J. 2023. [Google Scholar] [CrossRef]
- Gagandeep; Verma, J.; Gupta, D. Investigating the Transformative Effects of AI, Machine Learning, and Robotics on Human Capital Analytics—An Empirical Study. Hum. Cap. Anal. 2025, 111–141. [Google Scholar] [CrossRef]
- López, O.A.M.; López, A.M.; Crossa, J. Fundamentals of Artificial Neural Networks and Deep Learning. In Multivariate Statistical Machine Learning Methods for Genomic Prediction; Springer: Berlin/Heidelberg, Germany, 2022; pp. 379–425. [Google Scholar] [CrossRef]
- Subramanian, Y.R.; R, R. The Transformative Role of Artificial Intelligence in Human Resource. Int. J. Recent Trends Bus. Tour. 2024, 8, 14–25. [Google Scholar] [CrossRef]
- Arora, M.; Prakash, A.; Mittal, A.; Singh, S. HR Analytics and Artificial Intelligence-Transforming Human Resource Management. In Proceedings of the 2021 International Conference on Decision Aid Sciences and Application, DASA 2021, Virtual, 7–8 December 2021; pp. 288–293. [Google Scholar] [CrossRef]
- Căvescu, A.M.; Popescu, N. Predictive Analytics in Human Resources Management: Evaluating AIHR’s Role in Talent Retention. AppliedMath 2025, 5, 99. [Google Scholar] [CrossRef]
- Shafie, M.R.; Khosravi, H.; Farhadpour, S.; Das, S.; Ahmed, I. A cluster-based human resources analytics for predicting employee turnover using optimized Artificial Neural Networks and data augmentation. Decis. Anal. J. 2024, 11, 100461. [Google Scholar] [CrossRef]
- Bonilla-Chaves, E.F.; Palos-Sánchez, P.R. Exploring the Evolution of Human Resource Analytics: A Bibliometric Study. Behav. Sci. 2023, 13, 244. [Google Scholar] [CrossRef]
- Krishna Adabala, S. Citation: Adabala SK. AI in HR Evolution: Harnessing Machine Learning for Modern Solutions. J. Artif. Intell. Mach. Learn. Data Sci. 2023, 2023, 1702–1707. [Google Scholar] [CrossRef]
- Ahmad, B.; Sattar, M.U.; Khan, H.W.; Qureshi, Z.; Hasan, R.; Ahmad, A.; Azad, M. Intelligent Digital Twin to make Robot Learn the Assembly process through Deep Learning. Lahore Garrison Univ. Res. J. Comput. Sci. Inf. Technol. 2021, 5, 65–72. [Google Scholar] [CrossRef]
- Kambhampati, V.K.; Rao, K.B.V.B. Advancing Employee Attrition Models: A Systematic Review of Machine Learning Techniques and Emerging Research Opportunities. In Proceedings of the 8th International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud), I-SMAC 2024—Proceedings, Kirtipur, Nepal, 3–5 October 2024; pp. 1100–1107. [Google Scholar] [CrossRef]
- Chung, D.; Yun, J.; Lee, J.; Jeon, Y. Predictive model of employee attrition based on stacking ensemble learning. Expert Syst. Appl. 2023, 215, 119364. [Google Scholar] [CrossRef]
- Quinteros, D.M. Acadlore Transactions on AI and Machine Learning Predictive Modelling of Employee Attrition Using Deep Learning. Acadlore Trans. Ai Mach. Learn 2023, 2, 212–225. [Google Scholar] [CrossRef]
- Varkiani, S.M.; Pattarin, F.; Fabbri, T.; Fantoni, G. Predicting employee attrition and explaining its determinants. Expert Syst. Appl. 2025, 272, 126575. [Google Scholar] [CrossRef]
- Shafeeq, S.; Ali, M.; Azam, M.; Hashmi, M.U.; Ullah, M.A.; Ittfaq, A. hybrid machine learning framework for predicting employee attrition. Cent. Manag. Sci. Res. 2025, 3, 578–589. [Google Scholar] [CrossRef]
- Chaudhary, M.; Gaur, L.; Chakrabarti, A.; Singh, G.; Jones, P.; Kraus, S. An integrated model to evaluate the transparency in predicting employee churn using explainable artificial intelligence. J. Innov. Knowl. 2025, 10, 100700. [Google Scholar] [CrossRef]
- Al-Shammari, M.; Ghanem, Y.A. A Systematic Literature Review of Quantitative Models for Predicting Employee Attrition. In Proceedings of the 2024 International Conference on Decision Aid Sciences and Applications (DASA), Manama, Bahrain, 11–12 December 2024. [Google Scholar] [CrossRef]
- Lee, K.; Lee, J.; Park, E. TFBlender: A hybrid time series attention model with data-driven macroeconomic perspectives for ELS Knock-In prediction. J. Big Data 2025, 12, 173. [Google Scholar] [CrossRef]
- Oyeniran, M.; Adekunle, J.; Sule, H.; Folorunso, O.; Alagbe, S.; Anifowoshe, T.; Robbert, C.; Ebonyem, B.; Ideh, G.; Oyelakin, S.; et al. Personalized Energy Optimization in Smart Homes Using Adaptive Machine Learning Models: A Feature-Driven Approach. Int. J. Artif. Intell. Sci. 2025, 2, 83–104. [Google Scholar] [CrossRef]
- Yao, X.; Xu, Z.; Ren, T.; Zeng, X.J. Feature-driven hybrid attention learning for accurate water quality prediction. Expert Syst. Appl. 2025, 276, 127160. [Google Scholar] [CrossRef]
- Talebi, H.; Bardsiri, A.K.; Bardsiri, V.K. Machine Learning Approaches for Predicting Employee Turnover: A Systematic Review. Eng. Rep. 2025, 7, e70298. [Google Scholar] [CrossRef]
- Talebi, H.; Bardsiri, A.K.; Bardsiri, V.K. Developing a hybrid machine learning model for employee turnover prediction: Integrating LightGBM and genetic algorithms. J. Open Innov. Technol. Mark. Complex. 2025, 11, 100557. [Google Scholar] [CrossRef]
- Behera, S.K.; Dash, R. A Novel Framework for Mental Illness Detection Leveraging TOPSIS-ModCHI-Based Feature-Driven Randomized Neural Networks. Math. Comput. Appl. 2025, 30, 67. [Google Scholar] [CrossRef]
- Ufeli, C.P.; Sattar, M.U.; Hasan, R.; Mahmood, S. Enhancing Customer Segmentation Through Factor Analysis of Mixed Data (FAMD)-Based Approach Using K-Means and Hierarchical Clustering Algorithms. Information 2025, 16, 441. [Google Scholar] [CrossRef]
- Haque, M.; Paralkar, T.A.; Rajguru, S.; Goyal, A.A.; Patil, T.; Upreti, K. Featuring Machine Learning Models to Evaluate Employee Attrition: A Comparative Analysis of Workforce Stability-Relating Factors. Int. Res. J. Multidiscip. Scope 2025, 6, 862–873. [Google Scholar] [CrossRef]
- Babar, A.R.; Samad, A. AI Adoption Imperatives for Employees Engagement: A Leadership Approach; IGI Global Scientific Publishing: Hershey, PA, USA, 2025; pp. 1–26. [Google Scholar] [CrossRef]
- Myszak, J.M.; Filina-Dawidowicz, L. Leaders’ Competencies and Skills in the Era of Artificial Intelligence: A Scoping Review. Appl. Sci. 2025, 15, 10271. [Google Scholar] [CrossRef]
- Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
- Huang, C.; Zhu, X.; Lu, M.; Zhang, Y.; Yang, S. XGBoost algorithm optimized by simulated annealing genetic algrithm for permeability prediction modeling of carbonate reservoirs. Sci. Rep. 2025, 15, 14882. [Google Scholar] [CrossRef]
- Bewick, V.; Cheek, L.; Ball, J. Statistics review 14: Logistic regression. Crit. Care 2005, 9, 112. [Google Scholar] [CrossRef]
- Mobley, W.H. Intermediate linkages in the relationship between job satisfaction and employee turnover. J. Appl. Psychol. 1977, 62, 237–240. [Google Scholar] [CrossRef]
- Tura, A.S. Determinants of Employee’s Turnover: A Case Study at Madda Walabu University Abebe Seboka Tura. Adv. Manag. Appl. Econ. 2020, 10, 1792–7552. [Google Scholar]
- Lazăr, F.; Rentea, G.C.; Mihai, A.; Niță, D.; Munch, S. Retention and turnover in social work practice: What role do trusting colleagues, overtime, and workload play in job satisfaction? J. Soc. Work Pract. 2025, 39, 21–36. [Google Scholar] [CrossRef]
- Saketh, S.; Saicharan, V.; Rohith, V.; Rao, D. Leveraging Knowledge Graphs and Explainable AI to Improve Employee Turnover Predictions. Int. J. Adv. Res. Educ. Technol. 2025, 12, 1213–1220. [Google Scholar]








| Model | Accuracy | Precision | Recall | F1-Score |
|---|---|---|---|---|
| Logistic Regression | 0.7993 | 0.4167 | 0.6383 | 0.5042 |
| Random Forest | 0.8367 | 0.4865 | 0.3830 | 0.4286 |
| XGBoost (SMOTE Hybrid) | 0.8435 | 0.5152 | 0.3617 | 0.4250 |
| XGBoost (ADASYN Hybrid) | 0.8231 | 0.4419 | 0.4043 | 0.4222 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Alyousef, M.I.; Khan, H.W.; Sattar, M.U. A Hybrid Predictive Model for Employee Turnover: Integrating Ensemble Learning and Feature-Driven Insights from IBM HR Analytics. Information 2026, 17, 208. https://doi.org/10.3390/info17020208
Alyousef MI, Khan HW, Sattar MU. A Hybrid Predictive Model for Employee Turnover: Integrating Ensemble Learning and Feature-Driven Insights from IBM HR Analytics. Information. 2026; 17(2):208. https://doi.org/10.3390/info17020208
Chicago/Turabian StyleAlyousef, Muna I., Hamza Wazir Khan, and Mian Usman Sattar. 2026. "A Hybrid Predictive Model for Employee Turnover: Integrating Ensemble Learning and Feature-Driven Insights from IBM HR Analytics" Information 17, no. 2: 208. https://doi.org/10.3390/info17020208
APA StyleAlyousef, M. I., Khan, H. W., & Sattar, M. U. (2026). A Hybrid Predictive Model for Employee Turnover: Integrating Ensemble Learning and Feature-Driven Insights from IBM HR Analytics. Information, 17(2), 208. https://doi.org/10.3390/info17020208

