全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

Tuning of Parallel Frequent Pattern Growth Algorithm Based on Distributed Coordination System
基于分布式协调系统的并行频繁模式增长算法的优化

Keywords: Frequent pattern growth algorithm,Parallel data mining,Distributed coordination system,Performance tuning
频繁模式增长算法,并行数据挖掘,分布式协调系统,性能优化

Full-Text   Cite this paper   Add to My Lib

Abstract:

Frequent pattern mining can find frequent pattern in data, and iYs an important step in the association rules mining. Parallel frequent pattern(PFP) algorithms apply it into parallel environment, which is suitable for massive data.Based on the implementation of Apache Mahout, this paper proposed a design for optimizing the counting and sorting parts of PFP using distributed coordination system. This design takes advantage of distributed coordination system and reduces the consumption on HDFS and memory of data node. Another benefit is that the counting procedure and sorting procedure start parallclly. At last this paper analyzed the experimental result and the difficulties for implementation for further study.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133