全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

Study on New Pretreatment Method for Chinese Text Classification System
文本自动分类系统文本预处理方法的研究

Keywords: Text Classification,Text Pretreatment,Stop-words,Chinese Term
文本分类
,文本预处理,停用词,中文分词

Full-Text   Cite this paper   Add to My Lib

Abstract:

Presents a new text pretreatment method that applying programme flows control to eliminate the single Chinese word, pure English words, number and Chinese words containing English letter or maths symbol from the original text vector. Consequently the features that represent the text turn into the pure Chinese term. As a result, not only dimension of original text vector is deduced greatly but the information contents of text vector are improved enormously.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133