%0 Journal Article
%T A Cluster Validity Function Based on Geometric Probability
一种基于几何概率的聚类有效性函数
%A LI Xiao-Wen
%A MAO Zheng-Yuan
%A LI Jian-Wei
%A
李晓雯
%A 毛政元
%A 李建微
%J 中国图象图形学报
%D 2008
%I
%X Determining optimum cluster number is a key research topic included in cluster validity,a fundamental unsolved problem in cluster analysis.In order to determine the optimum cluster number,this article proposes a new cluster validity function for two dimensional datasets theoretically based on geometric probability.The function uses of the relationship between a two dimensional dataset and the corresponding two dimensional discrete point set to measure the cluster structure of the dataset according to the distributive feature of the point set in the characteristic space.It is designed from the perspective of intuition and thus can be easily understood.During the process of measurement,the structure information of the point set has been stored in a line segment set generated by connecting each pair points in the point set.The cluster validity function is formed by comparing the values of line segment direction in the line segment set with those resulted from completely random condition.In the case study,it is testified that the pattern of the function curve generated with a given example dataset effectively enables the determination of the optimum cluster number of the dataset and supports the design of cluster algorithms.
%K Cluster validity
%K Geometric probability
%K Cluster analysis
%K The optimum cluster number
聚类有效性
%K 几何概率
%K 聚类分析
%K 最佳聚类数
%U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=D06194629680C940ACE75262F54B9D85&aid=FA9CA1C5CFE3657C545F5A73D2B82F45&yid=67289AFF6305E306&vid=FC0714F8D2EB605D&iid=59906B3B2830C2C5&sid=50B9C3FF1B4615D6&eid=DE73B43A18DEAEE3&journal_id=1006-8961&journal_name=中国图象图形学报&referenced_num=0&reference_num=16