OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

模式识别与人工智能 2007

N个最频繁项集挖掘算法*

, PP. 512-518

陈晓云,胡运发

Keywords: 数据挖掘,N个最频繁项集,支持度阈值,倒排矩阵

Full-Text Cite this paper Add to My Lib

Abstract:

频繁项集挖掘算法的计算复杂性和生成的频繁项集数量随着事务集项数的增加呈指数增长，最小支持度阈值成为控制这种增长的关键.然而，实际应用中仅使用支持度阈值难以有效控制频繁项集的规模.为此定义N个最频繁项集挖掘问题，并提出基于支持度阈值动态调整策略的宽度优先搜索算法NApriori和深度优先搜索算法IntvMatrix挖掘N个最频繁项集.实验表明，本文的2种方法的效率比朴素方法高2倍以上，特别当N值较低时，本文方法的效率优势更为明显.

References

[1]	Agrawal R, Imielinski T, Swami A. Mining Association Rules between Sets of Items in Large Databases // Proc of the ACM SIGMOD Conference on Management of Data. Washington, USA, 1993: 207216
[2]	Agrawal R, Srikant R. Fast Algorithms for Mining Association Rules // Proc of the International Conference on Very Large Databases. Santiago, USA, 1994: 487499
[3]	Han Jiawei, Pei Jian, Yin Yiwen. Mining Frequent Patterns without Candidate Generation: A FrequentPattern Tree Approach. Data Mining and Knowledge Discovery, 2004, 8(1): 5387
[4]	Hipp J, Guntzer U, Nakhaeizadeh G. Algorithms for Association Rule Mining-A General Survey and Comparison. SIGKDD Explorations, 2000, 2(2): 5864
[5]	Pei Jian, Han Jiawei, AslMortazavi B, et al. PrefixSpan: Mining Sequential Patterns Efficiently by PrefixProjected Pattern Growth // Proc of the 17th International Conference on Data Engineering. Heidelberg, Germany, 2001: 215224
[6]	Chen Xiaoyun, Chen Yi, Wang Lei, et al. Text Categorization Based on Classification Rules Tree by Frequent Patterns. Journal of Software, 2006, 17(5): 10171025 (in Chinese) (陈晓云,陈袆,王雷,等.基于分类规则树的频繁模式文本分类.软件学报. 2006, 17(5): 10171025)
[7]	Beil F, Ester M, Xu X. Frequent TermBased Text Clustering // Proc of the 8th International Conference on Knowledge Discovery and Data Mining. New York, USA, 2002: 436442
[8]	Fu A W C, Kwong R W W, Tang Jian. Mining NMost Interesting Itemsets // Proc of the International Symposium on Methodologies for Intelligent Systems. Lyon, France, 2000:5967
[9]	ElHajj M, Zaiane O R. Inverted Matrix: Efficient Discovery of Frequent Items in Large Datasets in the Context of Interactive Mining // Proc of the International Conference on Data Mining and Knowledge Discovery. Washington, USA, 2003: 109118
[10]	Richrdo B Y, Berthier R N. Modern Information Retrieval. Milan, Italy: AddisonWesley, 1999
[11]	Borgelt C,Kruse R. Induction of Association Rules: Apriori Implementation // Proc of the 15th Conference on Computational Statistics. Berlin, Germany, 2001: 395400

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133