%0 Journal Article %T GNP-based scheduling strategy for distributed crawling
基于GNP算法的分布式爬虫调度策略* %A LIU Shuang %A JIANG Chun-xiang %A ZHANG Wei-zhe %A LI Dong %A ZHANG Hong %A
刘爽 %A 姜春祥 %A 张伟哲 %A 李东 %A 张鸿 %J 计算机应用研究 %D 2010 %I %X In order to solve task scheduling and load balancing problems of distributed search engines,this paper proposed a GNP-based scheduling strategy for distributed crawling and a load balancing method.Adopted internet distance estimating mechanism as a replacement for large-scale network distance measurement,which not only improved response time of the system,but also reduced WAN pressure caused by the system.Through deploying crawling nodes at WANs,built a distributed search engine,and implemented several sche... %K distributed crawling %K scheduling strategies %K load balancing %K network measurement %K GNP(global network positioning)
分布式爬虫 %K 任务调度 %K 负载均衡 %K 网络测量 %K 全局网络定位 %U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=A9D9BE08CDC44144BE8B5685705D3AED&aid=3BDE995BF121E7E8B106D5E77C9CBE16&yid=140ECF96957D60B2&vid=DB817633AA4F79B9&iid=0B39A22176CE99FB&sid=BD7D27247C63490C&eid=78BF76CF5B7CB0F2&journal_id=1001-3695&journal_name=计算机应用研究&referenced_num=0&reference_num=9