全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

一种多值属性多类标数据决策树算法*

, PP. 815-820

Keywords: 分类,决策树,多值属性,多类标数据,相似度

Full-Text   Cite this paper   Add to My Lib

Abstract:

目前处理多值属性多类标数据的算法有多值多类标分类器(MMC)和多值多类标决策树(MMDT).本文在研究前面两种算法的基础上提出新的相似度计算公式sim3,并通过改进MMDT基于一致性的评定方法,提出一种处理多值属性多类标数据的算法SCC_SP,综合考虑两个多类标集合的相似性和一致性,更有利于选择最佳分裂属性.通过对比实验证明,在相同的预测机制下,SCC_SP的预测准确度比MMDT高,能更好地处理多值属性多类标数据.

References

[1]  Han Juo, Kamber M. Data Mining Concept and Techniques. Los Altos, USA: Morgan Kaufmann Publishers, 2001
[2]  Shafer J C, Agrawal R, Mehta M. SPRINT: A Scalable Parallel Classifier for Data Mining // Proc of the 22nd International Conference on Very Large Databases. Mumbai, India, 1996: 544-555
[3]  Chen Y L, Hsu C L, Chou S C. Constructing a Multi-Valued and Multi-Labeled Decision Tree. Expert Systems with Applications, 2003, 25(2): 199-209
[4]  Chou S C, Hsu C L. MMDT: A Multi-Valued and Multi-Labeled Decision Tree Classifier for Data Mining. Expert Systems with Applications, 2005, 28(2): 799-812
[5]  de Mantaras R L. A Distance-Based Attribute Selection Measure for Decision Tree Induction. Machine Learning, 1991, 5(6): 81-92
[6]  Agrawal R, Ghosh S, Imielinski T, et al. An Interval Classifier for Database Mining Applications // Proc of the 18th International Conference on Very Large Databases. Vancouver, USA, 1992: 560-573
[7]  Ruggieri S. Efficient C4.5. IEEE Trans on Knowledge and Data Engineering, 2002, 14(2): 438-444
[8]  Wang H X, Zaniolo C. CMP: A Fast Decision Tree Classifier Using Multivariate Predictions // Proc of the 16th International Conference on Data Engineering. San Diego, USA, 2000: 449-460

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133