%0 Journal Article
%T A Survey on Web Crawling
Web信息采集研究进展
%A LI Sheng-Tao YU Zhi-Hua CHENG Xue-Qi BAI Shuo
%A
李盛韬
%A 余智华
%J 计算机科学
%D 2003
%I
%X As a basic component of search engine and a series of other services on Web,Web crawler is playing an important role. Roughly,a Web crawler is a program which automatically traverses the Web by downloading documents and following links from page to page. This article detailedly explains the principles and difficulties on the Web crawler, comprehensively argues several hot directions of Web crawler,and at last views the new direction of Web crawler.
%K Web crawling
%K Web gathering
%K Search engine
%K WWW
%K Agent
Web
%K 信息采集
%K 信息发布
%K Internet
%K Intranet
%K 计算机网络
%U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=64A12D73428C8B8DBFB978D04DFEB3C1&aid=8FC913D2E56768E1&yid=D43C4A19B2EE3C0A&vid=340AC2BF8E7AB4FD&iid=0B39A22176CE99FB&sid=70AC2EF7F2065E09&eid=0B4F496D54044D86&journal_id=1002-137X&journal_name=计算机科学&referenced_num=15&reference_num=28