|
计算机应用 2006
New effective method for spam filtering
|
Abstract:
A new effective method for spam filtering according to the principle of granularity was presented. First, this method divided spam class and legit class in train corpus into four small classes, and four center vectors were obtained. In the view of the principle of granularity, smaller granularity is used to describe knowledge in train corpus. When faltering, the new E-mail was compared with four center vectors respectively to decide which class it belonged to. This method was tested on spain corpus and compared with KNN. The results show that the new method has some advantages including high accuracy, high speed of filtering and so on.