In customer relationship management, it is important for e-commerce
businesses to attract new customers and
retain existing ones. Research on customer churn prediction using AI technology
is now a major part of e-commerce management. This paper proposes a
churn prediction model based on the combination of k-means clustering and
AdaBoost classifier algorithm, allowing the segmentation of customers into
three categories. Important customer groups can also be determined based on
customer behavior and temporal data. Customer churn prediction was carried out
using AdaBoost classification and BP neural network techniques. The results
show that the research method of clustering before prediction can improve
prediction accuracy. In addition, a comparative analysis of the results suggests that the AdaBoost model has better prediction accuracy than the BP neural
network model. The research results of this paper can help B2C e-commerce
companies develop customer retention measures and marketing strategies.
References
[1]
Abbasimehr, H., Setak, M., & Tarokh, M. J. (2014). A Comparative Assessment of the Performance of Ensemble Learning in Customer Churn Prediction. The International Arab Journal of Information Technology, 11, 599-606.
[2]
Ahmed, A. A. Q., & Maheswari, D. (2019). An Enhanced Ensemble Classifier for Telecom Churn Prediction Using Cost Based Uplift Modeling. International Journal of Information Technology, 11, 381-391. https://doi.org/10.1007/s41870-018-0248-3
[3]
Alboukaey, N., Joukhadar, A., & Ghneim, N. (2020). Dynamic Behavior Based Churn Prediction in Mobile Telecom. Expert Systems with Applications, 162, Article ID: 113779.
[4]
Aliyun TIANCH (2021). Alibaba Cloud Tianchi Data Sets. https://tianchi.aliyun.com/dataset/dataDetail?dataId=649
[5]
Amin, A., Anwar, S., Adnan, A., Adnan, A., Nawaz, M., Alawfi, K., Hussain, A. et al. (2017). Customer Churn Prediction in the Telecommunication Sector Using a Rough Set Approach. Neurocomputing, 237, 242-254.
[6]
Bi, Q. Q. (2019). Cultivating Loyal Customers through Online Customer Communities: A Psychological Contract Perspective. Journal of Business Research, 103, 34-44.
[7]
Bock, K. W. D., & De, C. A. (2021). Spline-Rule Ensemble Classifiers with Structured Sparsity Regularization for Interpretable Customer Churn Modeling. Decision Support Systems, 150, 1-14.
[8]
Breiman, L. (1996). Bagging Predictors. Machine Learning, 24, 123-140. https://doi.org/10.1007/BF00058655
[9]
Breiman, L. (2001). Random Forests. Machine Learning, 45, 5-32. https://doi.org/10.1023/A:1010933404324
[10]
Cao, L. (2008). Behavior Informatics and Analytics: Let Behavior Talk. In F. Bonchi, B. Berendt, F. Giannotti, D. Gunopulos, F. Turini, C. Zaniolo, N. Ramakrishnan, & X.D. Wu (Eds.), 2008 IEEE International Conference on Data Mining Workshops (pp. 87-96). IEEE Computer Society Press. https://doi.org/10.1109/ICDMW.2008.95
[11]
Chen, M. C., Chiu, A. L., & Chang, H. H. (2005). Mining Changes in Customer Behavior in Retail Marketing. Expert Systems with Applications, 28, 773-781.
[12]
Chen, N., Chen, A., & Zhou, L. (2001). An Effective Clustering Algorithm in Large Transaction Databases. Journal of Software, 12, 476-484.
[13]
Chung, B. D., Park, J. H., Koh, Y. J., & Lee, S. (2016). User Satisfaction and Retention of Mobile Telecommunications Services in Korea. International Journal of Human-Computer Interaction, 32, 532-543. https://doi.org/10.1080/10447318.2016.1179083
[14]
Coussement, K. (2014). Improving Customer Retention Management through Cost-Sensitive Learning. European Journal of Marketing, 48, 477-495. https://doi.org/10.1108/EJM-03-2012-0180
[15]
Coussement, K., Lessmann, S., & Verstraeten, G. (2017). A Comparative Analysis of Data Preparation Algorithms for Customer Churn Prediction: A Case Study in the Telecommunication Industry. Decision Support Systems, 95, 27-36.
[16]
Daoud, R. A., Amine, A. E., Bouikhalene, B., & Lbibb, R. (2018). Clustering Prediction Techniques in Defining and Predicting Customers Defection: The Case of E-Commerce Context. International Journal of Electrical and Computer Engineering, 8, 2367-2383. https://doi.org/10.11591/ijece.v8i4.pp2367-2383
[17]
De, C. A., Coussement, K. Verbeke, W. Idbenjra, K., & Phan, M. (2021). Uplift Modeling and Its Implications for B2B Customer Churn Prediction: A Segmentation-Based Modeling Approach. Industrial Marketing Management, 99, 28-39.
[18]
De, C. A., Coussement, K., & De Bock, K. W. (2018). A New Hybrid Classification Algorithm for Customer Churn Prediction Based on Logistic Regression and Decision Trees. European Journal of Operational Research, 269, 760-772.
[19]
Devriendt, F., Berrevoets, J., & Verbeke, W. (2021). Why You Should Stop Predicting Customer Churn and Start Using Uplift Models. Information Sciences, 548, 497-515.
[20]
Dumitrescu, E., Hue, S., Hurlin, C., & Tokpavi, S. (2022). Machine Learning for Credit Scoring: Improving Logistic Regression with Non-Linear Decision-Tree Effects. European Journal of Operational Research, 297, 1178-1192.
[21]
Eichinger, F., Nauck, D. D., & Klawonn, F. (2006). Sequence Mining for Customer Behaviour Predictions in Telecommunications. In S. Bickel (Ed.), The Workshop on Practical Data Mining: Applications, Experiences and Challenges (pp. 3-10). Springer.
[22]
Fader, P. S., Hardie, B. G. S., & Lee, K. L. (2005). “Counting Your Customers” the Easy Way: An Alternative to the Pareto/NBD Model. Marketing Science, 24, 275-284. https://doi.org/10.1287/mksc.1040.0098
[23]
Fan, X., & Ke, T. (2010). Enhanced Maximum AUC Linear Classifier. In M. Z. Li, Q. L. Liang, L. P. Wang, & Y. B. Song (Eds.), International Conference on Fuzzy Systems and Knowledge Discovery (pp. 1540-1544). Institute of Electrical and Electronics Engineers.
[24]
Farquad, M. A. H., Ravi, V., & Raju, S. B. (2014). Churn Prediction Using Comprehensible Support Vector Machine: An Analytical CRM Application. Applied Soft Computing, 19, 31-40.
[25]
Freund, Y., & Schapire, E. R. (1996). Experiments with a New Boosting Algorithm. In L. Saitta (Ed.), Machine Learning: The Thirteenth International Conference (pp. 1-9). Morgan Kaufmann.
[26]
Gattermann-Itschert, T., & Thonemann, U. W. (2021). How Training on Multiple Time Slices Improves Performance in Churn Prediction. European Journal of Operational Research, 295, 664-674.
[27]
Goldstein, B. A., Polley, E. C., & Briggs, F. B. S. (2011). Random Forests for Genetic Association Studies. Statistical Applications in Genetics and Molecular Biology, 10, 32. https://doi.org/10.2202/1544-6115.1691
[28]
Gordini, N., & Veglio, V. (2014). Using Neural Networks for Customer Churn Prediction Modeling: Preliminary Findings from the Italian Electricity Industry. In Convegno Annuale della Società Italiana Marketing: “Smart Life. Dall’Innovazione Tecnologica al Mercato (pp. 1-13). Università degli Studi di Milano-Bicocca.
[29]
Gordini, N., & Veglio, V. (2017). Customers Churn Prediction and Marketing Retention Strategies, an Application of Support Vector Machines Based on the AUC Parameter-Selection Technique in B2B E-Commerce Industry. Industrial Marketing Management, 62, 100-107.
[30]
Hadden, J., Tiwari, A., Roy, R., & Ruta, D. (2007). Computer Assisted Customer Churn Management: State-of-the-Art and Future Trends. Computers & Operations Research, 34, 2902-2917.
[31]
Jahromi, A. T., Stakhovych, S., & Ewing, M. (2014). Managing B2B Customer Churn, Retention and Profitability. Industrial Marketing Management, 43, 1258-1268.
[32]
Ji, H., Ni, F., & Liu, J. (2021). Prediction of Telecom Customer Churn Based on XGB-BFS Feature Selection Algorithm. Computer Technology and Development, 31, 21-25.
[33]
Kotler, P., & Keller, K. (2008). Marketing Management: International Edition. Prentice Hall.
[34]
Kotler, P., & Keller, K. (2016). Marketing Management (15nd ed., pp. 89-120.). Pearson Education Ltd.
[35]
Kozak, J., Kania, K., Juszczuk, P., & Mitręga, M. (2021). Swarm Intelligence Goal-Oriented Approach to Data-Driven Innovation in Customer Churn Management. International Journal of Information Management, 60, 16 p.
[36]
Minka, T. P. (2004). A Comparison of Numerical Optimizers for Logistic Regression. Technical Report (Mathematics), 18 p.
[37]
Neslin, S. A., Gupta, S., Kamakura, W., Lu, J. X., & Mason, C. H. (2006). Defection Detection: Measuring and Understanding the Predictive Accuracy of Customer Churn Models. Journal of Marketing Research, 43, 204-211. https://doi.org/10.1509/jmkr.43.2.204
[38]
Nie, G. L., Rowe, W., Zhang, L. L., Tian, Y. J., & Shi, Y. (2011). Credit Card Churn Forecasting by Logistic Regression and Decision Tree. Expert Systems with Applications, 38, 15273-15285.
[39]
O’Brien, M., Liu, Y., Chen, H. Y., & Lusch, R. (2020). Gaining Insight to B2B Relationships through New Segmentation Approaches: Not All Relationships Are Equal. Expert Systems with Applications, 161, 1-11.
[40]
Orsenigo, C., & Vercellis, C. (2010). Combining Discrete SVM and Fixed Cardinality Warping Distances for Multivariate Time Series Classification. Pattern Recognition, 43, 3787-3794.
[41]
Pham, D. T., Dimov, S. S., & Nguyen, C. D. (2004). Selection of K in K-Means Clustering. Proceedings of the Institution of Mechanical Engineers, Part C: Journal of Mechanical Engineering Science, 219, 103-119. https://doi.org/10.1243/095440605X8298
[42]
Schaeffer, S. E., & Sanchez, S. V. R. (2020). Forecasting Client Retention-A Machine-Learning Approach. Journal of Retailing and Consumer Services, 52, Article ID: 101918. https://doi.org/10.1016/j.jretconser.2019.101918
[43]
Schmittlein, D. C., Morrison, D. G., & Colombo, R. (1987). Counting Your Customers: Who Are They and What Will They Do Next? Management Science, 33, 1-24. https://doi.org/10.1287/mnsc.33.1.1
[44]
Tian, L., Qiu, H., & Zheng, L. (2007). Telecom Chum Prediction Modeling and Application Based on Neural Network. Computer Applications, 27, 2294-2297.
[45]
Vafeiadis, T., Diamantaras, K. I., Sarigiannidis, G., & Chatzisavvas, K. C. (2015). A Comparison of Machine Learning Techniques for Customer Churn Prediction. Simulation Modelling Practice and Theory, 55, 1-9. https://doi.org/10.1016/j.simpat.2015.03.003
[46]
Velez, D., Ayuso, A., Perales-González, C., & Rodriguez, J. T. (2020). Churn and Net Promoter Score Forecasting for Business Decision-Making through a New Stepwise Regression Methodology. Knowledge-Based Systems, 196, Article ID: 105762. https://doi.org/10.1016/j.knosys.2020.105762
[47]
Verbeke, W., Dejaeger, K., Martens, D., Hur, J., & Baesens, B. (2012). New Insights into Churn Prediction in the Telecommunication Sector: A Profit Driven Data Mining Approach. European Journal of Operational Research, 218, 211-229. https://doi.org/10.1016/j.ejor.2011.09.031
[48]
Viaene, S., & Dedene, G. (2005). Cost-Sensitive Learning and Decision Making Revisited. European Journal of Operational Research, 166, 212-220. https://doi.org/10.1016/j.ejor.2004.03.031
[49]
Wu, X., & Meng, S. (2016). E-Commerce Customer Churn Prediction Based on Customer Segmentation and AdaBoost. In 13th International Conference on Service Systems and Service Management (pp.). Institute of Electrical and Electronics Engineers.
[50]
Xie, Y. Y., & Li, X. (2008). Churn Prediction with Linear Discriminant Boosting Algorithm. In International Conference on Machine Learning and Cybernetics (pp. 228-233). Institute of Electrical and Electronics Engineers.
[51]
Ying, W. Y., Lin, N., Xie, Y. Y., & Li, X. (2010). Research on the LDA Boosting in Customer Churn Prediction. Journal of Applied Statistics & Management, 29, 400-408.
[52]
Zhang, W., Yang, S., & Liu, T. (2014). Customer Churn Prediction in Mobile Communication Enterprises Based on CART and Boosting Algorithm. Chinese Journal of Management Science, 22, 90-96.
[53]
Zhang, Y., & Zhang, Z. M. (2015). A Customer Churn Alarm Model Based on the C5.0 Decision Tree-Taking the Postal Short Message as an Example. Statistics & Information Forum, 30, 89-94.
[54]
Zhao, M., Zeng, Q. J., Chang, M., Tong, Q., & Su, J. F. (2021). A Prediction Model of Customer Churn Considering Customer Value: An Empirical Research of Telecom Industry in China. Discrete Dynamics in Nature and Society, 2021, Article ID: 7160527.