|
- 2019
PERFORMANCE EVALUATION OF ENSEMBLE LEARNING ALGORITHMS ON UNBALANCED CREDIT SCORING DATA SETSKeywords: Kredi skorlama,kolektif ??renme,dengesiz veri seti Abstract: Purpose- As the credit scoring model is developed, the accuracy of the models is low due to the unbalanced distribution of the samples belonging to the classes. In this study, we tried to determine the most effective models by comparing the performance of the models obtained by using collective learning algorithms together with cost sensitive learning method. Methodology- For this purpose, Bagging and AdaBoost collective learning methods were run on two different credit data sets with decision trees, support vector machines and k-NN basic classifiers. In addition, the penal score of the minority classification group was increased by using cost-sensitive learning method for Bagging and AdaBoost. All these combinations were compared. Findings- The use of cost-sensitive learning methods has led to more successful results in terms of AUC for both AdaBoost and Bagging. It was observed that the Bagging collective method, which is the main classifier of decision trees, had higher success than the AdaBoost collective method in the case of increasing class imbalance rate in the data. Conclusion- Although the development of a highly effective credit scoring method is still a problem that needs to be solved, it has been observed that the models created by the collective learning method show higher success than the models created by individual classifiers. This situation coincides with the findings of other studies in the literature. [Maciej Zi?ba ve ark., 2012
|