%0 Journal Article %T An Attributes Correlation Based Approach for Estimating Size of Web Databases
基于属性相关度的Web数据库大小估算方法 %A LING Yan-Yan %A MENG Xiao-Feng %A LIU Wei %A
凌妍妍 %A 孟小峰 %A 刘伟 %J 软件学报 %D 2008 %I %X An approach based on the word frequency is proposed in this paper to estimate the size of Web database. It obtains a random sample on a certain attribute by analyzing the attribute correlations among all the textual attributes in the query interface. The size of a Web database can be estimated by submitting probing queries which are generated by top-k frequent words to the query interface of a Web database. The experiments on several real-world databases have proved that this approach is effective and can achieve high accuracy in estimating the size of Web databases. %K word frequency %K Web database size estimation %K attributes correlation
词频 %K Web数据库大小估计 %K 属性相关度 %U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=7735F413D429542E610B3D6AC0D5EC59&aid=B98E3644A165AAABCE8A7629D94AA003&yid=67289AFF6305E306&vid=2A8D03AD8076A2E3&iid=0B39A22176CE99FB&sid=A1266CF37D675CF1&eid=FBCA02DBD05BD4EA&journal_id=1000-9825&journal_name=软件学报&referenced_num=10&reference_num=14