%0 Journal Article
%T Hybrid algorithm for classification of unbalanced datasets
不平衡数据分类的混合算法
%A HAN Min
%A ZHU Xin-rong
%A
韩敏
%A 朱新荣
%J 控制理论与应用
%D 2011
%I
%X A novel hybrid algorithm of radial basis function neural network(RBFNN) integrated with the random forest algorithm is proposed to improve the poor classification result produced by traditional algorithm in classifying minor class of unbalanced datasets. Firstly, random interpolations are inserted between adjacent data in the minor dataset to balance the data distribution. Receiver operator characteristics(ROC) with degree of confidence less than 95% are considered the redundant characteristic and are deleted. The input data are perturbed by the Bagging technique. Radial Basis Function Neural Network is employed to be the basic classifier in the random forest. The fusion of decisions and the outputs are determined by the vast majority of votes. This method is applied to UCI dataset. The precision of G-mean and the area under the ROC demonstrate the improvement of the accuracy in the classifications of medium-size unbalanced and largesize unbalance class data sets.
%K imbalanced data
%K random forest
%K radial basis function neural network(RBFNN)
%K receiver operator characteristics(ROC)
不平衡数据
%K 随机森林
%K 径向基函数神经网络
%K 受试者特征曲线
%U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=970898A57DFC021F93AB51667BAED7F7&aid=0AAC9CFC9B630DAADB59D0C7890850A3&yid=9377ED8094509821&vid=D3E34374A0D77D7F&iid=F3090AE9B60B7ED1&sid=44B95CDA8EBD6F56&eid=1254F6F9A8625D48&journal_id=1000-8152&journal_name=控制理论与应用&referenced_num=0&reference_num=18