|
计算机应用研究 2009
Improved feature selection algorithm in spam filtering based on TF*IDF
|
Abstract:
With the development of network and computer,more and more spam e-mails affect our lives.This paper firstly introduced the current popular feature selection methods based on term frequency and inversed document frequency.Then it compared and analyzed the various feature extraction algorithms,and introduced a new extracted feature algorithm by using the advanced TF*IDF.Finally it completed the experimental verification with the PU1 corpus.The experiment results demonstrate that the advanced naive Bayes filte...