%0 Journal Article
%T Outlier Detection and Semi-Supervised Clustering Algorithm Based on Shared Nearest Neighbors
基于最近邻相似度的孤立点检测及半监督聚类算法
%A ZHENG Ling-Zhi
%A HUANG De-Cai
%A
郑灵芝
%A 黄德才
%J 计算机系统应用
%D 2012
%I
%X Traditional clustering analysis is unsupervised. Its precision is affected by similarity measures and outlier in the dataset and the algorithm don't take advantage of prior knowledge which can reflect the demands of users, therefore this article proposes the outlier detection and semi-supervised clustering algorithm which based on shared nearest neighbors. The algorithm according to the number of the nearest neighbors of the data in the dataset to detect the outliers in data dataset, then deal with the dataset which be operated by detecting the outliers by using Semi-clustering. And during the clustering process, it adds some prior knowledge which was expanded and cluster the dataset based on the principle of graph segmentation. And the article uses some UCI datasets to make simulation experiments. The results show that the algorithm can detect the outliers effectively, and have good performance of the clustering effect.
%K outliers
%K shared nearest neighbors
%K semi-supervised clustering
%K prior knowledge
孤立点
%K 共享最近邻
%K 半监督聚类
%K 先验知识
%U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=D4F6864C950C88FFCE5B6C948A639E39&aid=6490A3A6E5E7C9810BF49CF1A23FC4B6&yid=99E9153A83D4CB11&vid=659D3B06EBF534A7&iid=0B39A22176CE99FB&sid=7555FB9CC973F695&eid=CDEBD1ACE0A4C1C1&journal_id=1003-3254&journal_name=计算机系统应用&referenced_num=0&reference_num=10