|
计算机应用研究 2012
Application of text categorization based on improved maximum entropy means clustering algorithm
|
Abstract:
In view of the traditional text classification algorithm has the problems of the characteristics having same influence on classification results,the low rate of classification accuracy,and the increasing of the algorithm time complexity,this paper presented an improved maximum entropy C-means clustering text classification methods.This method combined the C-means clustering algorithm and the maximum entropy algorithm,set Shannon entropy as a maximum entropy model in the target function,simplified classifier forms of expression,and then used the C-means clustering algorithm to the optimal features for classification.The simulation results show that,compared with traditional text classification methods,the proposed method can fast obtain the optimal classification feature subset,greatly improve the accuracy of text classification.