Credit card companies must be able to identify fraudulent credit card transactions so that clients are not charged for items they did not purchase. Previously, many machine learning approaches and classifiers were used to detect fraudulent transactions. However, because fraud patterns are always changing, it is becoming increasingly vital to investigate new frauds and develop the model based on the new patterns. The purpose of this research is to create a machine learning classifier that not only detects fraud but also detects legitimate transactions. As a result, the model should have excellent accuracy, precision, recall, and f1-score. As a result, we began with a large dataset in this study and used four machine learning classifiers: Support Vector Machine (SVM), Decision Tree, Na?ve Bayes, and Random Forest. The random forest classifier scored 99.96% overall accuracy with the best precision, recall, f1-score, and Matthews correlation coefficient in the experiments.
References
[1]
Xuan, S., Liu, G., Li, Z., Zheng, L., Wang, S. and Jiang, C. (2018) Random Forest for Credit Card Fraud Detection. 2018 IEEE 15th International Conference on Networking, Sensing and Control (ICNSC), Zhuhai, 27-29 March 2018, 1-6. https://doi.org/10.1109/icnsc.2018.8361343
[2]
Vats, S., Dubey, S.K. and Pandey, N.K. (2013) A Tool for Effective Detection of Fraud in Credit Card System. International Journal of Communication Networks and Security, 2, 25-29. https://doi.org/10.47893/ijcns.2013.1062
[3]
Patel, R.D. and Singh, D.K. (2013) Credit Card Fraud Detection & Prevention of Fraud Using Genetic Algorithm. International Journal of Soft Computing and Engineering, 2, 292-294.
[4]
Deepika, T. and Manimekalai, S. (2022) A Novel Method to Find Credit Card Counterfeit Detection Using K-Means Algorithm. Journal of Algebraic Statistics, 13, 1125-1130.
[5]
Zhou, W., Xue, X. and Luo, D. (2022). Credit Card Fraud Detection Using Boundary Reconstruction and Integrated Classification. 2022 4th International Conference on Big Data Engineering, Beijing, 26-28 May 2022, 86-93. https://doi.org/10.1145/3538950.3538962
[6]
Rigatti, S.J. (2017) Random Forest. Journal of Insurance Medicine, 47, 31-39. https://doi.org/10.17849/insm-47-01-31-39.1
[7]
Suthaharan, S. (2016) Science of Information. In: Suthaharan, S., Ed., Machine Learning Models and Algorithms for Big Data Classification, Springer, 1-12.
[8]
Soltani, N., Akbari, M.K. and Sargolzaei Javan, M. (2012). A New User-Based Model for Credit Card Fraud Detection Based on Artificial Immune System. The 16th CSI International Symposium on Artificial Intelligence and Signal Processing (AISP 2012), Shiraz, 2-3 May 2012, 29-33. https://doi.org/10.1109/aisp.2012.6313712
[9]
Webb, G.I., Keogh, E. and Miikkulainen, R. (2010) Naïve Bayes. In: Sammut, C. and Webb, G.I., Eds., Encyclopedia of Machine Learning, Springer, 713-714.
[10]
Ozçelik, M.H., Duman, E., Isik, M. and Cevik, T. (2010). Improving a Credit Card Fraud Detection System Using Genetic Algorithm. 2010 International Conference on Networking and Information Technology, Manila, 11-12 June 2010, 436-440. https://doi.org/10.1109/icnit.2010.5508478
[11]
Yu, W. and Wang, N. (2009). Research on Credit Card Fraud Detection Model Based on Distance Sum. 2009 International Joint Conference on Artificial Intelligence, Hainan, 25-26 April 2009, 353-356. https://doi.org/10.1109/jcai.2009.146
[12]
Myles, A.J., Feudale, R.N., Liu, Y., Woody, N.A. and Brown, S.D. (2004) An Introduction to Decision Tree Modeling. Journal of Chemometrics, 18, 275-285. https://doi.org/10.1002/cem.873
[13]
Stolfo, S.J., Fan, W., Lee, W., Prodromidis, A., and Chan, P.K. (2000) Cost-Based Modeling for Fraud and Intrusion Detection: Results from the JAM Project. Proceedings DARPA Information Survivability Conference and Exposition. DISCEX’00, Hilton Head, 25-27 January 2000, 130-144.
[14]
Prodromidis, A.L. and Stolfo, S. (1999) Agent-Based Distributed Learning Applied to Fraud Detection. Department of Computer Science, Columbia University.
[15]
Borah, L., Saleena, B. and Prakash, B. (2020) Credit card Fraud Detection Using Data Mining Techniques. Seybold Report, 15, 2431-2436.
[16]
Meenakshi, B.D., Janani, B., Gayathri, S. and Indira, N. (2019) Credit Card Fraud Detection Using Random Forest. International Research Journal of Engineering and Technology (IRJET), 6, 2019.
[17]
Sahin, Y. and Duman, E. (2011). Detecting Credit Card Fraud by ANN and Logistic Regression. 2011 International Symposium on Innovations in Intelligent Systems and Applications, Istanbul, 15-18 June 2011, 315-319. https://doi.org/10.1109/inista.2011.5946108