%0 Journal Article
%T Modified K-means algorithm based on new cluster validity index<br>基于新聚类有效性函数的改进K-means算法
%A SUN Xiu-juan
%A LIU Xi-yu
%A <br>孙秀娟
%A 刘希玉
%J 计算机应用
%D 2008
%I 
%X The class number k is one of the key factors to influence cluster quality in K-means algorithm. Several cluster validity measures have been proposed for confirming the optimal k value. However, the existing methods may not work well for the following two kinds of data sets: the data set containing cluster groups with different densities and the data set in which the cluster groups are extremely close to each other. Therefore, a new cluster validity index was proposed. The index was defined as the ratio value between the squared total length of the data eigen-axes and the between-cluster separation (the data set containing merged cluster group). If the value reaches the minimum, the clustering number is the optimal one. At the same time, in order to reduce the sensitivity of K-means algorithm to isolation point and noise, a K-wmeans clustering algorithm based on weights was put forward to calculate clustering centers. Experimental results show that the proposed algorithm gives more accurate results than the other algorithm. A modified K-means algorithm based on a new cluster validity index not only reduces the impact of isolation point and noise but also effectively deals with the two kinds of data sets mentioned above, improving the quality of data clustering.
%K clustering
%K k-means algorithm
%K cluster validity<br>聚类
%K K-means算法
%K 聚类有效性
%U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=831E194C147C78FAAFCC50BC7ADD1732&aid=1CE690EE00F5901A4E219321B5CFEE74&yid=67289AFF6305E306&vid=D3E34374A0D77D7F&iid=59906B3B2830C2C5&sid=5F2FEFD7AAE27FB3&eid=991752DE8BE844DD&journal_id=1001-9081&journal_name=计算机应用&referenced_num=0&reference_num=15