|
计算机科学 2008
Two-class Text Categorization Method Based on Naive Bayes and GA
|
Abstract:
A two-class text categorization method based on Naive Bayes and GA is proposed. It transforms a Naive Bayesian classifier into a problem of search for a division line that fits the text data set distribution in a constructed two-dimensional space. A genetic algorithm is used to search an optimal division line on unreliable text area. The experiment results on a chinese text data set consisting of 12600 texts show that our method has good performance and efficiency. The precision, recall and F1 are 97.98%, 9...