%0 Journal Article
%T An Attributes Correlation Based Approach for Estimating Size of Web Databases
基于属性相关度的Web数据库大小估算方法
%A LING Yan-Yan
%A MENG Xiao-Feng
%A LIU Wei
%A
凌妍妍
%A 孟小峰
%A 刘伟
%J 软件学报
%D 2008
%I
%X An approach based on the word frequency is proposed in this paper to estimate the size of Web database. It obtains a random sample on a certain attribute by analyzing the attribute correlations among all the textual attributes in the query interface. The size of a Web database can be estimated by submitting probing queries which are generated by top-k frequent words to the query interface of a Web database. The experiments on several real-world databases have proved that this approach is effective and can achieve high accuracy in estimating the size of Web databases.
%K word frequency
%K Web database size estimation
%K attributes correlation
词频
%K Web数据库大小估计
%K 属性相关度
%U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=7735F413D429542E610B3D6AC0D5EC59&aid=B98E3644A165AAABCE8A7629D94AA003&yid=67289AFF6305E306&vid=2A8D03AD8076A2E3&iid=0B39A22176CE99FB&sid=A1266CF37D675CF1&eid=FBCA02DBD05BD4EA&journal_id=1000-9825&journal_name=软件学报&referenced_num=10&reference_num=14