Affiliation:
1. Department of Computer Engineering & Science, Case Western Reserve University
Abstract
In many database applications, one of the common queries is to find approximate matches to a given query item from a collection of data items. For example, given an image database, one may want to retrieve all images that are similar to a given query image. Distance based index structures are proposed for applications where the data domain is high dimensional, or the distance function used to compute distances between data objects is non-Euclidean. In this paper, we introduce a distance based index structure called multi-vantage point (mvp) tree for similarity queries on high-dimensional metric spaces. The mvp-tree uses more than one vantage point to partition the space into spherical cuts at each level. It also utilizes the pre-computed (at construction time) distances between the data points and the vantage points. We have done experiments to compare mvp-trees with vp-trees which have a similar partitioning strategy, but use only one vantage point at each level, and do not make use of the pre-computed distances. Empirical studies show that mvp-tree outperforms the vp-tree 20% to 80% for varying query ranges and different distance distributions.
Publisher
Association for Computing Machinery (ACM)
Subject
Information Systems,Software
Cited by
98 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Enhancing K-nearest neighbor algorithm: a comprehensive review and performance analysis of modifications;Journal of Big Data;2024-08-11
2. GTS: GPU-based Tree Index for Fast Similarity Search;Proceedings of the ACM on Management of Data;2024-05-29
3. HJG: An Effective Hierarchical Joint Graph for ANNS in Multi-Metric Spaces;2024 IEEE 40th International Conference on Data Engineering (ICDE);2024-05-13
4. SCORE: Scalable Contact Tracing over Uncertain Trajectories;Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering;2024
5. Design of Dyadic Gabor Wavelet Filter Banks;Feature Extraction in Medical Image Retrieval;2024