%0 Journal Article
%T Website Crawling for Specific Topics
领域相关的Web网站抓取方法
%A LI Gang
%A ZHOU Li-Zhu
%A GUO Qi
%A LIN Ling
%A
李刚
%A 周立柱
%A 郭奇
%A 林玲
%J 计算机科学
%D 2007
%I
%X In this paper, we propose a new approach to discover the Websites for special topic in WWW with high precision and low cost. This approach improves traditional Focused Crawler techniques, different from the common Web crawler which accesses the Web graph composed by HTML pages and hyperlinks, our crawler uses Meta-Seareh to get the URLs of relevant page, then uses heuristic search method to reduce the search cost, and uses topic relevant rules to increase the precision. The experimental results show the presented approach is both effective and efficient.
%K Meta-Search
%K Focused crawler
%K Heuristic search
Meta-Search
%K 聚焦爬虫(Focused
%K Crawler)
%K 启发式搜索
%U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=64A12D73428C8B8DBFB978D04DFEB3C1&aid=B75FD199B1D41B3BBEE024C5412F13B8&yid=A732AF04DDA03BB3&vid=339D79302DF62549&iid=0B39A22176CE99FB&sid=205BE674D84A456D&eid=B0EBA60720995721&journal_id=1002-137X&journal_name=计算机科学&referenced_num=1&reference_num=10