%0 Journal Article
%T An Algorithm Based on RGH-Tree for Similarity Search Queries
基于广义超曲面树的相似性搜索算法
%A ZHANG Zhao-gong
%A LI Jian-zhong
%A
张兆功
%A 李建中
%J 软件学报
%D 2002
%I
%X Similarity search is a very important problem in data mining. It retrieves similar objects in database and finds proximity between objects. It can be applied to image/picture databases, spatial databases, and time-series databases. For Euclid space (a special metric space), similarity search algorithms based on R-tree are efficient in low-dimensional space, but degenerate into linear scan for high-dimensional space. This phenomenon is called dimensionality curse. This paper presents a new partition and index method of metric space, rgh-tree which distributes and partitions objects by using distance information if objects with rew fixed reference. It produces a balance tree with no data overlay. In addition, an algorithm based on rgh-tree, which is suitable for similarity search in metric space, is presented in this paper. The algorithm overcomes the shortcomings of the exiting algorithms, which has less I/O cost and times of computing distance, with average complexity o(n0.58).
%K algorithm
%K similarity search query
%K metric space
%K database
算法
%K 相似性搜索
%K 度量空间
%K 数据库
%U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=7735F413D429542E610B3D6AC0D5EC59&aid=211474AD19C80CE6&yid=C3ACC247184A22C1&vid=FC0714F8D2EB605D&iid=F3090AE9B60B7ED1&sid=F3D9B969D1F8BD1F&eid=64CE2972A4410367&journal_id=1000-9825&journal_name=软件学报&referenced_num=2&reference_num=13