Affiliation:
1. Chungbuk National University
Abstract
Locality-sensitive hashing is a technique to allow approximate nearest search for large volume of data in a fast manner. Binary code locality-sensitive hashing distributes a data set into buckets labeled with binary code, where binary codes are determined by a set of hash functions. The binary hash codes play the role of partitioning the data space into subspaces. When close neighbors are placed around subspace boundaries, there are chances to fail in locating them. It requires to check neighboring buckets while finding nearest ones. The paper presents a technique to enhance the search performance by introducing the notion of extended boundary. It reduces the potential misses and the search overhead especially for the regions located at the double-napped corners. Keywords: locality sensitive hashing, data search, hashing, data analysis
Publisher
Trans Tech Publications, Ltd.
Reference5 articles.
1. P. Indyk and R. Motwani, Approximate Nearest Neighbors: Towards Removing the Curse of Dimensionality, Proc. of STOC1998 (1998).
2. K. M. Lee, Locality-sensitive Hashing Techniques for Nearest Neighbor Search, Int. J. of Fuzzy Logic and Intell. Syst., 12(4) (2012).
3. M. Datar, N. Immorlica, P. Indyk, and V. S. Mirrokni, Locality-sensitive Hashing Scheme based on p-stable Distribution, Symp. on Computational Geometry, p.253–262 (2004).
4. R. R. Salakhutdinov, G.E. Hinton, Semantic hashing, proc. of SIGIR 2007 (2007).
5. A. Gionis, P. Indyk, and R. Motwani, Similarity Search in High Dimensions via Hashing, Proc. of VLDB1999 (1999).
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献