%0 Journal Article
%T Tuning of Parallel Frequent Pattern Growth Algorithm Based on Distributed Coordination System
基于分布式协调系统的并行频繁模式增长算法的优化
%A WANG Jie
%A DAI Qing-hao
%A LI Huan
%A
王洁
%A 戴清濒
%A 李环
%J 计算机科学
%D 2012
%I
%X Frequent pattern mining can find frequent pattern in data, and iYs an important step in the association rules mining. Parallel frequent pattern(PFP) algorithms apply it into parallel environment, which is suitable for massive data.Based on the implementation of Apache Mahout, this paper proposed a design for optimizing the counting and sorting parts of PFP using distributed coordination system. This design takes advantage of distributed coordination system and reduces the consumption on HDFS and memory of data node. Another benefit is that the counting procedure and sorting procedure start parallclly. At last this paper analyzed the experimental result and the difficulties for implementation for further study.
%K Frequent pattern growth algorithm
%K Parallel data mining
%K Distributed coordination system
%K Performance tuning
频繁模式增长算法,并行数据挖掘,分布式协调系统,性能优化
%U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=64A12D73428C8B8DBFB978D04DFEB3C1&aid=0B45337AE7EF0451317DD9015C5FC7F5&yid=99E9153A83D4CB11&vid=7C3A4C1EE6A45749&iid=38B194292C032A66&sid=0584DB487B4581F4&eid=B1F98368A47B8888&journal_id=1002-137X&journal_name=计算机科学&referenced_num=0&reference_num=0