|
计算机应用研究 2008
Method of multi-topic Web text classification based on VSM
|
Abstract:
Withdrawing characteristic vectors for a given Web page,calculating the similarities of the page characteristic vectors with classification characteristic vectors, getting dynamic thresholds through using K-means clustered methods and looking for result classifications, this paper proposed a multi-topic Web text classification method of vector space model based on dynamic threshold. Through comparing the value of every classification similarity with dynamic threshold, classifyed the multitopic texts of a Web page to several different text classifications. The simulating experiments verify the good accuracy and better recalling with this method.