Zhang Yufang, Peng Sh im ing, Lü Jia. Improvem ent and application o fTFIDF m ethod based on tex t classification[ J]. Computer Eng ineering, 2006, 32( 19): 76-78. ( in Chinese)
[3]
[ Sebastiani F. M ach ine learn ing in au tom ated tex t ca tego rization[ J]. ACM Computing Surveys, 2002, 34( 1): 1-47.
[4]
[ Lew is D D, Na?ve Bayes. The independence assum ption in in fo rm ation re trieval[ C ] / / The 10 th European Con f onM achine
[5]
Learning. N ew York: Springer-Verlag, 1998.
[6]
[ Y im ingY ang, X in L iu. A re-ex am ination o f text ca tego rization m e thods[ C ] / / S IGIR’ 99. New York: ACM Press, 1999: 42-49.
[7]
[ Yang Y, Chute C G. An exam ple-based mapp ingm e thod for tex t categor ization and re trieval[ J]. ACM T rans on Inform ation System s, 1994, 12( 3): 252-277.
[8]
[ H an E H, Karyp is G. Centro id-based docum ent c lassifica tion: analysis and experim enta l results[ C] / / Proc of PKDD’ 00. London: Springer-Ver lag, 2000: 424-431.
[9]
[ Schapire R E, SingerY. Im proved boosting algorithm s using confidence-rated pred ica tions[ C ] / / Proc of the 11 th Annual Conf on Computational Learn ing Theory. M adison: ACM Press, 1998: 80-91.
[10]
[ Joach im s T. Tex t categor ization w ith support vecto rm ach ines: learn ing w ith m any re levant featu res[ C ] / / The 10th European Confon Machine Learn ing. B erlin: Spr ing er, 1998: 137-142.
Xu Fengya, Luo Zhensheng. An improved approach to term we ighting in autom ated tex t classification[ J]. Com puter Eng ineering and App lica tions, 2005( 1): 181-184. ( in Ch inese)
Zhang Yuntao, Gong Ling, W ang Yong cheng. An im proved TF- IDF approach for text class ification[ J]. Journal of Zhe jiang University, 2005, 6A( 1): 49-55. ( in Ch inese)
Kou Shasha, W e i Zhenjun. Im proved w eigh ting fo rmu la in auto tex t c lassifica tion[ J]. Computer Eng ineer ing and Des ign,2005, 26( 6): 1 616-1 618. ( in Ch inese)
L iRong lu. Tex t c lassica tion system [ DB /OL ]. Data Se t, http: / /www. nlp. org. cn /docs/download. php? doc- id= 102.2004- 08- 19. ( in Chinese)
[19]
[ Dav id D, Lew is. Reuters- 21578, Test Co llections[ R /OL] . h ttp: / /www. dav iddlew is. com / resources/ testco llections/ reuters21578/. 1996.