全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...
软件学报  2006 

Advances in Machine Learning Based Text Categorization
基于机器学习的文本分类技术研究进展

Keywords: automatic text categorization,machine learning,dimensionality reduction,kernel method,unlabeled data set,skewed data set,hierarchical categorization,large-scale text categorization,Web page categorization
自动文本分类
,机器学习,降维,核方法,未标注集,偏斜数据集,分级分类,大规模文本分类,Web页分类

Full-Text   Cite this paper   Add to My Lib

Abstract:

In recent years, there have been extensive studies and rapid progresses in automatic text categorization, which is one of the hotspots and key techniques in the information retrieval and data mining field. Highlighting the state-of-art challenging issues and research trends for content information processing of Internet and other complex applications, this paper presents a survey on the up-to-date development in text categorization based on machine learning, including model, algorithm and evaluation. It is pointed out that problems such as nonlinearity, skewed data distribution, labeling bottleneck, hierarchical categorization, scalability of algorithms and categorization of Web pages are the key problems to the study of text categorization. Possible solutions to these problems are also discussed respectively. Finally, some future directions of research are given.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133