Personal credit risk assessment is an important part
of the development of financial enterprises. Big data credit investigation is
an inevitable trend of personal credit risk assessment, but some data are
missing and the amount of data is small, so it is difficult to train. At the
same time, for different financial platforms, we need to use different models
to train according to the characteristics of the current samples, which is
time-consuming. In view of these two
problems, this paper uses the idea of transfer learning to build a transferable
personal credit risk model based on Instance-based Transfer Learning (Instance-based
TL). The model balances the weight of the samples in the source domain, and
migrates the existing large dataset samples to the target domain of small
samples, and finds out the commonness between them. At the same time, we have
done a lot of experiments on the selection of base learners, including
traditional machine learning algorithms and ensemble learning algorithms, such
as decision tree, logistic regression, xgboostand so on. The datasets are from P2P
platform and bank, the results show that the AUC value of Instance-based TL is
24% higher than that of the traditional machine learning model, which fully
proves that the model in this paper has good application value. The model’s
evaluation uses AUC, prediction, recall, F1. These criteria prove that this
model has good application value from many aspects. At present, we are trying
to apply this model to more fields to improve the robustness and applicability
of the model; on the other hand, we are trying to do more in-depth research on
domain adaptation to enrich the model.
References
[1]
He, S., Liu, Z.D. and Ma, X.L. (2019) A Comparative Review of Credit Scoring Models—Comparison between Traditional Methods and Data Mining. Credit Reference, 37, 57-61.
[2]
Pan, S.J. and Yang, Q. (2010) A Survey on Transfer Learning. IEEE Transactions on Knowledge and Data Engineering, 22, 1345-1359.
https://doi.org/10.1109/TKDE.2009.191
[3]
Weiss, K., Khoshgoftaar, T.M. and Wang, D.D. (2016) A Survey of Transfer Learning. Journal of Big Data, 3, Article No. 9. https://doi.org/10.1186/s40537-016-0043-6
[4]
Dai, W.Y., Yang, Q., Xue, G.-R. and Yu, Y. (2007) Boosting for Transfer Learning. Proceedings of the 24th International Conference on Machine Learning, Corvalis Oregon, June, 2007, 193-200. https://doi.org/10.1145/1273496.1273521
[5]
Cook, D., Feuz, K.D. and Krishnan, N.C. (2013) Transfer Learning for Activity Recognition: A Survey. Knowledge and Information Systems, 36, 537-556.
https://doi.org/10.1007/s10115-013-0665-3
[6]
Kermany, D.S., Goldbaum, M., Cai, W., Valentim, C.C., Liang, H., Baxter, S.L., McKeown, A., Yang, G., Wu, X., Yan, F., et al. (2018) Identifying Medical Diagnoses and Treatable Diseases by Image-Based Deep Learning. Cell, 172, 1122-1131.E9.
https://doi.org/10.1016/j.cell.2018.02.010
[7]
Zhuang, F.-Z., Luo, P., He, Q., Shi, Z.-Z. (2015) Survey on Transfer Learning Research. Journal of Software, 26, 26-39. http://www.jos.org.cn/1000-9825/4631.html
[8]
Han, X.L. (2018) User Credit Rating Modeling Based on Xgboost. Computer Knowledge and Technology, No. 27, 7-8.
[9]
Zhu, Y., Chen, Y., Lu, Z., Pan, S.J., Xue, G.R., Yu, Y. and Yang, Q. (2011) Heterogeneous Transfer Learning for Image Classification. Proceedings of the 2011 AAAI Conference on Artificial Intelligence, San Francisco, 7-11 August 2011, 1304-1309.
[10]
Pan, W., Xiang, E.W. and Yang, Q. (2012) Transfer Learning in Collaborative Filtering with Uncertain Ratings. Proceedings of the 2012 AAAI Conference on Artificial Intelligence, Toronto, 22-26 July 2012, 662-668.
[11]
Zhu, B., He, C.-Z. and Li, H.-Y. (2015) Research on Credit Scoring Model Based on Transfer Learning, Operations Research and Management Science, No. 2, 201-207.
[12]
Zheng, L.T., Liu, G.G., Yan, C.G., Jiang, C.J., Zhou, M.C. and Li, M.Z. (2020) Improved TrAdaBoost and Its Application to Transaction Fraud Detection. IEEE Transactions on Computational Social Systems, 7, 1304-1316.
https://doi.org/10.1109/TCSS.2020.3017013
[13]
Wang, X. (2019) Application of Transfer Learning in Quantitative Stock Selection. Dissertation, South China University of Technology, Guangzhou.
[14]
Yu, Z.Z. (2012) Ensemble Transfer Learning Algorithm for Imbalanced Sample Classification. Acta Electronica Sinica, 40, 1358-1363.
[15]
Chen, T.Q. and Guestrin, C. (2016) XGBoost: A Scalable Tree Boosting System. The 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, August 2016, 785-794.
https://doi.org/10.1145/2939672.2939785
[16]
Liu, J.C., Jiang, X.H. and Wu, J.P. (2003) Realization of a Knowledge Inference Rule Induction System. Systems Engineering, 21, 107-112.
http://dx.chinadoi.cn/10.3969/j.issn.1001-4098.2003.03.022