全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...
软件学报  2008 

Tri-Training and Data Editing Based Semi-Supervised Clustering Algorithm
基于Tri-Training和数据剪辑的半监督聚类算法

Keywords: semi-supervised clustering,semi-supervised classification,K-means,seeds set,Tri-training,depuration data editing
半监督聚类
,半监督分类,K-均值,seeds集,Tri-Training,Depuration数据剪辑

Full-Text   Cite this paper   Add to My Lib

Abstract:

In this paper, a algorithm named DE-Tri-training semi-supervised K-means is proposed, which could get a seeds set of larger scale and less noise. In detail, prior to using the seeds set to initialize cluster centroids, the training process of a semi-supervised classification approach named Tri-training is used to label unlabeled data and add them into the initial seeds set to enlarge the scale. Meanwhile, to improve the quality of the enlarged seeds set, a nearest neighbor rule based data editing technique named Depuration is introduced into Tri-training process to eliminate and correct the mislabeled noise data in the enlarged seeds. Experimental results show that the novel semi-supervised clustering algorithm could effectively improve the cluster centroids initialization and enhance clustering performance.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133