全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

New text categorization method based on the frequency of topic words
基于主题词频数特征的文本主题划分

Keywords: search engine,document clustering,Fuzzy C-Means(FCM),topic word filtering
搜索引擎
,文本聚类,模糊C-均值,主题词筛选

Full-Text   Cite this paper   Add to My Lib

Abstract:

The word frequency matrix currently used in text categorization is characterized with high dimensionality and excessive sparsity.These two features caused some difficulties to computing.To solve this problem,according to the search engine users' selections,a new text categorization method based upon the feature of topic words frequency was proposed.This approach was designed to filter new concept topic words by statistical method,and then the FCM clustering algorism was applied to the documents,using the frequency of topic words rather than the frequency of single word as the feature.This method performs well in the experiment.Furthermore,this method was compared in many aspects with a text categorization method based on clusters,and some useful conclusions about implementation and application were reached.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133