全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

A C4.5 Decision Tree Based Algorithm for Web Pages Categorization
一种基于C4.5决策树的Web页面分类算法

Keywords: web text categorization,C4,5 decision tree,information theory,information gain ratio,web crawler
WEB文本分类
,C4.5决策树,信息论,信息增益率,网络爬虫

Full-Text   Cite this paper   Add to My Lib

Abstract:

Web text categorization can be applied to many domains such as information retrieval, news categorization, etc. Decision tree algorithm is a simple method for categorization and has been used extensively. This paper investigates the basic method and process to build a web classifier by means of C4.5 decision tree, which has various merits such as high categorization precision, high categorization speed, etc. Moreover, this paper proposes a C4.5 decision tree based frame of web pages classifier, and implements it on a web crawler. The experimental results show that this algorithm is highly effective.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133