全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...
软件学报  2003 

Constructing Binary Classification Trees with High Intelligibility
具有高可理解性的二分决策树生成算法研究

Keywords: machine learning,binary classification tree,information gain,pruning,range-splitting based on continuous attributes transform (RCAT) algorithm
机器学习
,二分决策树,信息熵增益,剪枝,RCAT算法

Full-Text   Cite this paper   Add to My Lib

Abstract:

Binarization is the most popular discretization method in decision tree generation, while for the domain with many continuous attributes, it always gets a big incomprehensible tree which can't be described as knowledge. In order to get a more intelligible decision tree, this paper presents a new discretization algorithm, RCAT, for continuous attributes in the generation of binary classification tree. It uses simple binarization to solve the multisplitting problem through mapping a continuous attribute into another probability attribute based on statistic information. Two pruning methods are introduced to simplify the constructed tree. Empirical results of several domains show that, for the two-class problem with a preponderance of continuous attributes, RCAT algorithm can generate a much smaller decision tree efficiently with higher intelligibility than binarization while retaining predictive accuracy.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133