全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

基于机器学习的员工流失预测
Employee Attrition Prediction Based on Ma-chine Learning

DOI: 10.12677/MOS.2023.126502, PP. 5531-5542

Keywords: 员工流失预测,机器学习,RF-RFE,LightGBM,WRF
Employee Attrition Prediction
, Machine Learning, RF-RFE, LightGBM, WRF

Full-Text   Cite this paper   Add to My Lib

Abstract:

员工流失是当今组织中的重要问题,通过机器学习等技术对员工离职进行事前预测,有助于提升企业人力资源管理的前瞻性。本文首先从采集的数据集中提取有用的且适合模型训练条件的数据,进行数据清洗和探索性分析,了解各特征分布情况;使用One-Hot编码和标签编码相结合的方式进行编码,然后采用RF-RFE方法对数据集中的特征进行筛选后进入分类模型;为保证模型预测的准确性,采用了五种不同的机器学习算法,包括SVM、DT、RF、LightGBM和WRF,来建立模型对员工流失情况进行预测。综合结果显示,LightGBM算法在预测性能方面表现出色,其准确率达到了0.87;进而通过SHAP输出特征重要性排名,发现城市发展指数、工作经验和培训时数等是影响离职的重要因素,可以为企业员工保留和后续人才招聘决策提供技术支持。
Employee turnover is an important issue in today’s organizations. Predicting employee turnover in advance through machine learning and other technologies can help improve the foresight of enter-prise human resource management. In this paper, the useful data are extracted from the collected data set, and the data cleaning and exploratory analysis are carried out to understand the distribu-tion of features. A combination of One-Hot coding and label coding was used, and then the features in the dataset were screened by RF-RFE method and entered into the classification model. In order to ensure the accuracy of the model prediction, five different machine learning algorithms, includ-ing SVM, DT, RF, LightGBM and WRF, were used to build the model to predict the employee turnover situation. The results show that LightGBM algorithm has excellent prediction performance, and its accuracy rate reaches 0.87. By outputting SHAP feature importance ranking, it is found that urban development index, work experience, training hours, etc., are important factors affecting turnover, which can provide technical support for employee retention and follow-up talent recruitment deci-sions.

References

[1]  Ann, D. and McMahon, F. (1992) Labour Turnover in London Hotels and the Cost Effectiveness of Preventative Measures. In-ternational Journal of Hospitality Management, 11, 143-154.
https://doi.org/10.1016/0278-4319(92)90007-I
[2]  Ozmen, E.P. and Ozcan, T. (2022) A Novel Deep Learning Model Based on Convolutional Neural Networks for Employee Churn Prediction. Journal of Forecasting, 41, 539-550.
https://doi.org/10.1002/for.2827
[3]  刘春燕. 基于XGBoost的员工流失预测研究——以IBM公司为例[D]: [硕士学位论文]. 大连: 大连理工大学, 2021.
[4]  王冠鹏, 秦双燕, 崔恒建. 员工流失的影响因素分析与预测[J]. 系统科学与数学, 2022, 42(6): 1616-1632.
[5]  Cui, H.J., Li, R.Z. and Zhong, W. (2015) Model-Free Feature Screening for Ultrahigh Dimensional Discriminant Analysis. Journal of the American Statistical Association, 110, 630-641.
https://doi.org/10.1080/01621459.2014.920256
[6]  万毅斌, 王绍宇, 秦彦霞. 基于代价敏感加权支持向量机的员工离职分类预测[J]. 智能计算机与应用, 2021, 11(12): 43-46+53.
[7]  Mobley, W.H. (1977) Intermediate Link Ages in the Relationship between Job Satisfaction and Employee Turnover. Journal of Applied Psychology, 62, 237-240.
https://doi.org/10.1037/0021-9010.62.2.237
[8]  李芸, 胡可, 董欣雨, 袁淑俊. 基于SVM算法的企业员工离职预警研究[J]. 中国商论, 2020(6): 20-22.
[9]  Abellán, J., Mantas, C.J., Castellano, J.G. and Moral-García, S. (2018) Increasing Diversity in Random Forest Learning Algorithm via Imprecise Probabilities. Expert Systems with Applications, 97, 228-243.
https://doi.org/10.1016/j.eswa.2017.12.029
[10]  Gomes, H.M., Bifet, A., Read, J., Barddal, J.P., Enembreck, F., Pfharin-ger, B., Holmes, G. and Abdessalem, T. (2017) Adaptive Random Forests for Evolving Data Stream Classification. Machine Learning, 106, 1469-1495.
https://doi.org/10.1007/s10994-017-5642-8

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133