OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

电子与信息学报 2007

Research on Automatic Text Classification Based on a Hybrid Language Model
基于一种混合语言模型的自动文本分类技术研究

Zheng De-quan,Li Sheng,Zhao Tie-jun,Yu Hao,
郑德权,李生,赵铁军,于浩

Keywords: Text classification,Ontology,Hybrid language model,Context,Multi-grams
文本分类,本体,混合语言模型,上下文,多元信息

Full-Text Cite this paper Add to My Lib

Abstract:

With the volume of information available increase, text classification has become one of the key on the Internet and corporate intranets continues to technology in organizing and processing large amount of document data. This paper gives a novel method of Chinese text categorization based on a combination of ontology with statistical method. In this study, first, linguistic ontology knowledge bank will be respectively acquired by learning training corpus for various classes to determine the various categorizations. For a actual document, the evaluation value will respectively be gotten by various linguistic ontology knowledge bank and the categorization will be judged by the highest evaluation value. This method is compared with Bayes, k-nearest neighbor and support vector machine, The primary experimental results show that the method outperforms that previous work.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133

Research on Automatic Text Classification Based on a Hybrid Language Model基于一种混合语言模型的自动文本分类技术研究

Research on Automatic Text Classification Based on a Hybrid Language Model
基于一种混合语言模型的自动文本分类技术研究