代六玲,黄河燕,陈肇雄.中文文本分类中特征抽取方法的比较研究[J].中文信息学报,2004(1):26-33.DAI Liu-ling,HUANG He-yan,CHEN Zhao-xiong.A comparative study on feature selection in Chinese text categorization [J].Journal of Chinese Information Processing,2004(1):26-33.(in Chinese)
[4]
YANG Yi-ming,JAN O P.A Comparative Study on Feature Selection in Text Categorization[C]//Proceedings of the Fourteenth International Conference on Machine learning.Nashville:Morgan Kaufmann,1997:412-420.
[5]
ZHENG Zhao-hui,WU Xiao-yun,ROLINI S.Feature selection for text categorization on imbalanced data[J].ACM SIGKDD Explorations Newsletter,2004,6(1):80-89.
[6]
SALTON G.Introduction to modem information retrieval[M].New York:McGraw Hill Book Company,1983.
[7]
单松巍,冯是聪,李晓明.几种典型特征选取方法在中文分类上的效果比较[J].计算机工程与应用,2003,22:146- 148.SHAN Song-wei,FENG Shi-cong,LI Xiao-ming.A comparative study on several typical feature selection methods for Chinese web page categorization[J].Computer Engineering and Applications,2003,22:146-148.(in Chinese)
[8]
MLADEMNIC D,GROBELNIK M.Feature selection for unbalanced class distribution and naive bayees[C]//Proceedings of the Sixteenth International Conference on Machine Learning.Bled:Morgan Kaufmann,1999:258-267.
[9]
王梦云,曹素青.基于字频向量的中文文本自动分类系统[J].情报学报,2000,19(6):644-649.WANG Meng-yun,CAO Su-qing.The system for automatic text categorization based on Chinese character vector [J]. Journal of the China Society For Scientific and Technical Information,2000,19(6):644-649.(in Chinese)
[10]
VAPNIK V.The nature of statistical theory[M].New York:Aringer,1995.