%0 Journal Article
%T Multi-domain deep Web crawler based on most efficient queries
基于最优查询的多领域deep Web爬虫*
%A FENG Ming-yuan
%A LIN Huai-zhong
%A
冯明远
%A 林怀忠
%J 计算机应用研究
%D 2009
%I
%X Deep Web information can only be obtained through queries submitted to search forms in pages. While traditional hyperlinks based search engines were hard to index the deep Web data. To address this problem, proposed a most efficient queries based on deep Web crawler. It generated the most efficient queries through clustered Web pages, submitted the queries, and indexed the returned results. Experiment shows it can crawl data automatically and efficiently from multi-domain deep Web.
%K deep Web
deep
%K Web
%K deep
%K Web爬虫
%K 最优查询
%K 页面聚类
%U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=A9D9BE08CDC44144BE8B5685705D3AED&aid=2612FBFFC4ED64B16753E43FA541EAC7&yid=DE12191FBD62783C&vid=96C778EE049EE47D&iid=9CF7A0430CBB2DFD&sid=9F5513BCB1BF5DFF&eid=53FFBE35995BC10A&journal_id=1001-3695&journal_name=计算机应用研究&referenced_num=0&reference_num=12