Corporate distress signals are important for both institutions and banks when evaluating firms’ performances. This paper evaluates five different models in predicting the distress for listed companies in China based on 22 dimensions of financial data from 2014 to 2022. The models include three ensemble machine learning models: Adaboost, Bagging, and Random Forest, as well as a single machine learning model Decision Tree, along with a benchmark Logistic Regression. The comparative analysis found Random Forest to be the most promising method with the highest accuracy ratio and lowest Type I and Type II errors. This paper concludes that ensemble learning models could be an easy-to-replicate and highly efficient tool for institutions and banks to evaluate and predict potential distress in firms.
References
[1]
Smiti, S. and Soui, M. (2020) Bankruptcy Prediction Using Deep Learning Approach Based on Borderline Smote. Information Systems Frontiers, 22, 1067-1083. https://doi.org/10.1007/s10796-020-10031-6
[2]
Varian, H.R. (2014) Big Data: New Tricks for Econometrics. Journal of Economic Perspectives, 28, 3-28. https://doi.org/10.1257/jep.28.2.3
[3]
Wang, G. and Ma, J. (2011) Study of Corporate Credit Risk Prediction Based on Integrating Boosting and Random Subspace. Expert Systems with Applications, 38, 13871-13878. https://doi.org/10.1016/j.eswa.2011.04.191
[4]
Chen, J.M. (2021) An Introduction to Machine Learning for Panel Data. International Advances in Economic Research, 27, 1-16. https://doi.org/10.1007/s11294-021-09815-6
[5]
Andini, M., Ciani, E., de Blasio, G., D’Ignazio, A. and Salvestrini, V. (2018) Targeting with Machine Learning: An Application to a Tax Rebate Program in Italy. Journal of Economic Behavior & Organization, 156, 86-102. https://doi.org/10.1016/j.jebo.2018.09.010
[6]
Athey, S. (2018) The Impact of Machine Learning on Economics. In: Agrawal, A., Gans, J. and Goldfarb, A., Eds., The Economics of Artificial Intelligence: An Agenda, University of Chicago Press, Chicago, 507-547. https://doi.org/10.7208/chicago/9780226613475.003.0021
[7]
Kleinberg, J., Ludwig, J., Mullainathan, S. and Obermeyer, Z. (2015) Prediction Policy Problems. American Economic Review, 105, 491-495. https://doi.org/10.1257/aer.p20151023
[8]
Mullainathan, S. and Spiess, J. (2017) Machine Learning: An Applied Econometric Approach. Journal of Economic Perspectives, 31, 87-106. https://doi.org/10.1257/jep.31.2.87
[9]
Mayr, A., Binder, H., Gefeller, O. and Schmid, M. (2014) The Evolution of Boosting Algorithms. Methods of Information in Medicine, 53, 419-427. https://doi.org/10.3414/ME13-01-0122
[10]
Beutel, J., List, S. and von Schweinitz, G. (2019) Does Machine Learning Help Us Predict Banking Crises? Journal of Financial Stability, 45, Article ID: 100693. https://doi.org/10.1016/j.jfs.2019.100693
[11]
Jones, S. (2017) Corporate Bankruptcy Prediction: A High Dimensional Analysis. Review of Accounting Studies, 22, 1366-1422. https://doi.org/10.1007/s11142-017-9407-1
[12]
Kim, H., Cho, H. and Ryu, D. (2022) Corporate Bankruptcy Prediction Using Machine Learning Methodologies with a Focus on Sequential Data. Computational Economics, 59, 1231-1249. https://doi.org/10.1007/s10614-021-10126-5
[13]
Ohlson, J.A. (1980) Financial Ratios and the Probabilistic Prediction of Bankruptcy. Journal of Accounting Research, 18, 109-131. https://doi.org/10.2307/2490395
[14]
Zmijewski, M.E. (1984) Methodological Issues Related to the Estimation of Financial Distress Prediction Models. Journal of Accounting Research, 22, 59-82. https://doi.org/10.2307/2490859
[15]
Shumway, T. (2001) Forecasting Bankruptcy More Accurately: A Simple Hazard Model. Journal of Business, 74, 5-32. https://doi.org/10.1086/209665
[16]
Bonfim, D. (2009) Credit Risk Drivers: Evaluating the Contribution of Firm Level Information and of Macroeconomic Dynamics. Journal of Banking and Finance, 33, 281-299. https://doi.org/10.1016/j.jbankfin.2008.08.006
[17]
Dakovic, R., Czado, C. and Berg, D. (2010) Bankruptcy Prediction in Norway: A Comparison Study. Applied Economics Letters, 17, 1739-1746. https://doi.org/10.1080/13504850903299594
[18]
Campbell, J.Y., Hilscher, J. and Szilagyi, J. (2008) In Search of Distress Risk. Journal of Finance, 63, 2899-2939. https://doi.org/10.1111/j.1540-6261.2008.01416.x
[19]
Figlewski, S., Frydman, H. and Liang, W. (2012) Modeling the Effect of Macroeconomic Factors on Corporate Default and Credit Rating Transitions. International Re-view of Economics and Finance, 21, 87-105. https://doi.org/10.1016/j.iref.2011.05.004
[20]
Kukuk, M. and Rönnberg, M. (2013) Corporate Credit Default Models: A Mixed Logit Approach. Review of Quantitative Finance and Accounting, 40, 467-483. https://doi.org/10.1007/s11156-012-0281-4
[21]
Jessen, C. and Lando, D. (2015) Robustness of Distance-to-Default. Journal of Banking and Finance, 50, 493-505. https://doi.org/10.1016/j.jbankfin.2014.05.016
[22]
Glover, B. (2016) The Expected Cost of Default. Journal of Financial Economics, 119, 284-299. https://doi.org/10.1016/j.jfineco.2015.09.007
[23]
Kim, H., Cho, H. and Ryu, D. (2020) Corporate Default Predictions Using Machine Learning: Literature Review. Sustainability, 12, Article No. 6325. https://doi.org/10.3390/su12166325
[24]
Sun, J., Jia, M.-Y. and Li, H. (2011) AdaBoost Ensemble for Financial Distress Prediction: An Empirical Comparison with Data from Chinese Listed Companies. Expert Systems with Applications, 38, 9305-9312. https://doi.org/10.1016/j.eswa.2011.01.042
[25]
Jones, S., Johnstone, D. and Wilson, R. (2017) Predicting Corporate Bankruptcy: An Evaluation of Alternative Statistical Frameworks. Journal of Business Finance & Accounting, 44, 3-34. https://doi.org/10.1111/jbfa.12218
[26]
Barboza, F., Kimura, H. and Altman, E. (2017) Machine Learning Models and Bankruptcy Prediction. Expert Systems with Applications, 83, 405-417. https://doi.org/10.1016/j.eswa.2017.04.006
[27]
Altman, E.I. (1968) Financial Ratios, Discriminant Analysis and the Prediction of Corporate Bankruptcy. Journal of Finance, 23, 589-609. https://doi.org/10.1111/j.1540-6261.1968.tb00843.x
[28]
Nam, C.W., Kim, T.S., Park, N.J. and Lee, H.K. (2008) Bankruptcy Prediction Using a Discrete-Time Duration Model Incorporating Temporal and Macroeconomic Dependencies. Journal of Forecasting, 27, 493-506. https://doi.org/10.1002/for.985
[29]
Keasey, K. and Watson, R. (1987) Non-Financial Symptoms and the Prediction of Small Company Failure: A Test of Argenti’s Hypotheses. Journal of Business Finance and Accounting, 14, 335-354. https://doi.org/10.1111/j.1468-5957.1987.tb00099.x
[30]
Yeh, C.-C., Chi, D.-J. and Lin, Y.-R. (2014) Going-Concern Prediction Using Hybrid Random Forests and Rough Set Approach. Information Sciences, 254, 98-110. https://doi.org/10.1016/j.ins.2013.07.011
[31]
Du Jardin, P. (2016) A Two-Stage Classification Technique for Bankruptcy Prediction. European Journal of Operational Research, 254, 236-252. https://doi.org/10.1016/j.ejor.2016.03.008
[32]
Karas and Režňáková, M. (2017) The Stability of Bankruptcy Predictors in the Construction and Manufacturing Industries at Various Times before Bankruptcy. EMEkonomie a Management, 20, 116-133. https://doi.org/10.15240/tul/001/2017-2-009
[33]
Bajari, P., Nekipelov, D., Ryan, S.P. and Yang, M. (2015) Machine Learning Methods for Demand Estimation. American Economic Review, 105, 481-485. https://doi.org/10.1257/aer.p20151021
[34]
Breiman, L. (2001) Random Forests. Machine Learning, 45, 5-32. https://doi.org/10.1023/A:1010933404324
[35]
Freund, Y. and Schapire, R.E. (1997) A Decision-Theoretic Generalization of Online Learning and an Application to Boosting. Journal of Computer and System Sciences, 55, 119-139. https://doi.org/10.1006/jcss.1997.1504
[36]
Akiba, T., Sano, S., Yanase, T., Ohta, T. and Koyama, M. (2019) Optuna: A Next-Generation Hyperparameter Optimization Framework. Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Anchorage, 4-8 August 2019, 2623-2631. https://doi.org/10.1145/3292500.3330701
[37]
Zhou, L. (2013) Performance of Corporate Bankruptcy Prediction Models on Imbalanced Dataset: The Effect of Sampling Methods. Knowledge-Based Systems, 41, 16-25. https://doi.org/10.1016/j.knosys.2012.12.007
[38]
Chava, S. and Jarrow, R.A. (2004) Bankruptcy Prediction with Industry Effects. Review of Finance, 8, 537-569. https://doi.org/10.1093/rof/8.4.537