%0 Journal Article
%T Application of Improved K-Means Algorithm to Analysis of Online Public Opinions
改进的K-means算法在网络舆情分析中的应用
%A TANG Han-Qing
%A WANG Han-Jun
%A
汤寒青
%A 王汉军
%J 计算机系统应用
%D 2011
%I
%X Combining background application requirement of online public opinion analysis, this paper firstly introduces the processing of text information, and then discusses the K-means algorithm of the text clustering, according to its characteristic that clustering results depend on the centers of initial clustering, and improves it. Based on the thought that text title can express its content, the improved algorithm uses sparse character vector to express text title, calculates the sparse similarity of them and ascertains the centers of initial clustering. The experiments show that the method improves the clustering accuracy. Compared with another algorithm based on the principle of maximum and minimum distance, the improved method heightens the efficiency and ensures the clustering accuracy.
%K online public opinion
%K K-means clustering algorithm
%K text clustering
%K sparse character vector
网络舆情
%K K-means
%K 算法
%K 文本聚类
%K 稀疏特征向量
%U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=D4F6864C950C88FFCE5B6C948A639E39&aid=92FD002CEC06ADE70BED7E596EA04864&yid=9377ED8094509821&vid=A04140E723CB732E&iid=38B194292C032A66&sid=31611641D4BB139F&eid=A9415C81E6459A69&journal_id=1003-3254&journal_name=计算机系统应用&referenced_num=0&reference_num=10