%0 Journal Article
%T GNP-based scheduling strategy for distributed crawling
基于GNP算法的分布式爬虫调度策略*
%A LIU Shuang
%A JIANG Chun-xiang
%A ZHANG Wei-zhe
%A LI Dong
%A ZHANG Hong
%A
刘爽
%A 姜春祥
%A 张伟哲
%A 李东
%A 张鸿
%J 计算机应用研究
%D 2010
%I
%X In order to solve task scheduling and load balancing problems of distributed search engines,this paper proposed a GNP-based scheduling strategy for distributed crawling and a load balancing method.Adopted internet distance estimating mechanism as a replacement for large-scale network distance measurement,which not only improved response time of the system,but also reduced WAN pressure caused by the system.Through deploying crawling nodes at WANs,built a distributed search engine,and implemented several sche...
%K distributed crawling
%K scheduling strategies
%K load balancing
%K network measurement
%K GNP(global network positioning)
分布式爬虫
%K 任务调度
%K 负载均衡
%K 网络测量
%K 全局网络定位
%U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=A9D9BE08CDC44144BE8B5685705D3AED&aid=3BDE995BF121E7E8B106D5E77C9CBE16&yid=140ECF96957D60B2&vid=DB817633AA4F79B9&iid=0B39A22176CE99FB&sid=BD7D27247C63490C&eid=78BF76CF5B7CB0F2&journal_id=1001-3695&journal_name=计算机应用研究&referenced_num=0&reference_num=9