全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

Comparative study on text representation schemes in Chinese text classification
中文文本分类中的文本表示因素比较

Keywords: Chinese text classification,text presentation,vectorization
中文文本分类
,文本表示,向量化

Full-Text   Cite this paper   Add to My Lib

Abstract:

We investigated the representation methods for text classification, proposed the framework of analyzing Chinese text representation algorithms, analyzed the influence of text representation, and obtained the influence of variable text representation factors on classification effect. Using Chinese characters can directly obtain better effect than expected; there is little difference on classification effect among splitting articles with smaller or huger dictionary or even by complicated splitting algorithm; and classification with only 01 to represent whether a feature is presented in a text or not can lead to not bad effect. We also found it can greatly improve classification effect to use reasonable vector value such as suitable formalization algorithm. These conclusions have provided instructions to contifurther applications.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133