%0 Journal Article
%T Using Capture-Recapture approach estimate size of Web databases
用Capture-Recapture方法估计Web数据库大小*
%A MIAO Zhong-yi
%A HU Peng-yu
%A CUI Zhi-ming
%A
苗忠义
%A 胡鹏昱
%A 崔志明
%J 计算机应用研究
%D 2009
%I
%X In order to estimate the size of Web database, this paper proposed the Capture-Recapture based estimation methods that filtered out two words intimate and rejection cases. Submitting attributed high-frequency words in the text box of query interface, using the returned result, in the intersection of two results analyzing the independence of two sampling, filtering the dependent couples, and then using Capture-Recapture method estimated the size of Web database. In the simulated and real environment for the experiment, the bias and the volatility of the method are smaller.
%K size estimation
%K Deep Web
%K Web database
大小估计
%K 深网
%K 网络数据库
%U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=A9D9BE08CDC44144BE8B5685705D3AED&aid=C1B112C4CED4CDF08CBFD644EF46F5CB&yid=DE12191FBD62783C&vid=96C778EE049EE47D&iid=94C357A881DFC066&sid=7820732DED23DCED&eid=181DAA2DD1AE90C6&journal_id=1001-3695&journal_name=计算机应用研究&referenced_num=0&reference_num=11