|
计算机应用研究 2008
Similarity computing of documents based on VSM
|
Abstract:
The precision and efficiency of the computing of documents similarity is the foundation and key of other documents process.This paper improved the DF and TF-IDF arithmetic.In this way,DF's time complexity was linearity that suited the mass documents process,and could make up the fault that exceptional useful characters might be deleted.Also,it did a mend on the TF-IDF arithmetic to improve the precision of documents similarity.