%0 Journal Article
%T Real-time data stream clustering based on damped window and pruning dimension tree
基于衰减窗口与剪枝维度树的实时数据流聚类*
%A ZHANG Xiao-long
%A ZENG Wei
%A
张晓龙
%A 曾伟
%J 计算机应用研究
%D 2009
%I
%X This paper proposed a novel real-time data stream clustering algorithm PDStream, which was based on damped window. PDStream firstly divided data space into grids, then used an improved dimension tree structure to maintain and update the data stream summary statistics. Designed a pruning strategy to prune the sparse grids in dimension tree periodically. Finally used the depth first search (DSF) method to deal with online clustering request. The experimental results on synthetic dataset and real dataset demonstrate that PDStream has the advantages of discovering clusters of arbitrary shape effectively, low memory consumption, preferable precision.
%K data stream
%K grid clustering
%K damped window
%K dimension tree
%K pruning strategy
数据流
%K 网格聚类
%K 衰减窗口
%K 维度树
%K 剪枝策略
%U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=A9D9BE08CDC44144BE8B5685705D3AED&aid=A71BF2612A1ACE5BA8BB935F177F3F78&yid=DE12191FBD62783C&vid=96C778EE049EE47D&iid=E158A972A605785F&sid=09002DF587B7129E&eid=C0C56F7E9227DF7D&journal_id=1001-3695&journal_name=计算机应用研究&referenced_num=0&reference_num=11