全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...
软件学报  2002 

A Web Document Clustering Algorithm Based on Association Rule
基于关联规则的Web文档聚类算法

Keywords: document clustering,association rule,Web mining,WWW
文档聚类
,关联规则,Web挖掘,WWW

Full-Text   Cite this paper   Add to My Lib

Abstract:

By grouping similar Web documents into clusters, the search space can be reduced, the search accelerated, and its precision improved. In this paper, a new clustering algorithm is introduced. In the clustering technique, topics are represented according to VSM (vector space model), documents are represented according to topics, and the relation between documents and topics is viewed in a transactional form, each document corresponds to a transaction and each topic corresponds to an item. A frequent item sets can be found by using the association riles discovery algorithm,corresponding documents can be seen as initial clusters.These clusters are merged according to the disance between clusters,or divided aivided according to the strength of connection among documents of a cluster.By real Wed documents,experimental results show the algorithm's effectivenss and suitability for tackling the overlapping clusters inhered by documents.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133