|
计算机应用 2005
An automatic text classifier of class-based feature selection algorithm
|
Abstract:
Current feature selection algorithms are all based on term frequency,and ignore the class information in the training sample set.A new feature selection algorithm based on class information was put forward.The principle of the algorithm is as follows: according to the similarity difference caused by whether or not a word existed in a document,the discriminative power with that this word distinguished different documents could be determined.And then,the discriminative power was taken as the importance for