%0 Journal Article
%T Over-sampling algorithm based on preliminary classification in imbalanced data sets learning
不均衡数据集学习中基于初分类的过抽样算法
%A HAN Hui
%A WANG Lu
%A WEN Ming
%A WANG Wen-yuan
%A
韩慧
%A 王路
%A 温明
%A 王文渊
%J 计算机应用
%D 2006
%I
%X To significantly improve the classification performance of the minority class, an over-sampling algorithm based on preliminary classification was presented. Firstly, preliminary classification was made on the test data in order to save the useful information of the majority class as much as possible, Then the test data that were predicted to belong to minority class were reclassified to improve the classification performance of the minority class. Using the data sets provided by University of California, Irvine, the new algorithm was compared with synthetic minority over-sampling technique and under-sampling method. The experimental results show that the new algorithm performs better than the others in terms of the classification performance of the minority class and majority class.
%K imbalanced data sets
%K over-sampling
%K under-sampling
不均衡数据集
%K 过抽样
%K 欠抽样
%U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=831E194C147C78FAAFCC50BC7ADD1732&aid=DA9B7FEACB25B676&yid=37904DC365DD7266&vid=96C778EE049EE47D&iid=5D311CA918CA9A03&sid=B2F4AE6815C8FC11&eid=BCA72E9D2CFA70A9&journal_id=1001-9081&journal_name=计算机应用&referenced_num=0&reference_num=10