%0 Journal Article %T Web Key Resource Page Judgment Based on Improved Decision Tree Algorithm
基于改进决策树算法的网络关键资源页面判定 %A LIU Yi-Qun %A ZHANG Min %A MA Shao-Ping %A
刘奕群 %A 张敏 %A 马少平 %J 软件学报 %D 2005 %I %X Key resource page is one of the most important search target pages for Web search users. Decision tree learning is one of the most widely-used and practical methods for inductive inference in machine learning. Because of the difficulty in uniform sampling of Web pages, there are not enough negative instances for training a key resource decision tree. To solve the problem, the original algorithm is partly modified to learn from global instead of individual instance information. With the same evaluation method as TREC (Text Retrieval Conference) 2003, large scale retrieval experiments based on improved decision tree algorithm achieves more than 40% improvement than the ones based on the original algorithm. It not only offers an effective way for selecting Web key resource pages, but also shows a possible way to improve decision tree learning performances. %K Web information retrieval %K key resource page %K machine learning %K decision tree
网络信息检索 %K 关键资源页面 %K 机器学习 %K 决策树 %U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=7735F413D429542E610B3D6AC0D5EC59&aid=8BD0B0D05A3F8885&yid=2DD7160C83D0ACED&vid=7801E6FC5AE9020C&iid=708DD6B15D2464E8&sid=D1DB94C1649032D3&eid=C7EA1836BA194C9B&journal_id=1000-9825&journal_name=软件学报&referenced_num=10&reference_num=18