Similarity Search in the Blink of an Eye with Compressed Indices-Reference-Cited by-同舟云学术

Similarity Search in the Blink of an Eye with Compressed Indices

Published:2023-07 Issue:11 Volume:16 Page:3433-3446
ISSN:2150-8097
Container-title:Proceedings of the VLDB Endowment
language:en
Short-container-title:Proc. VLDB Endow.

Author:

Aguerrebere Cecilia¹,Bhati Ishwar Singh²,Hildebrand Mark²,Tepper Mariano²,Willke Theodore²

Affiliation:

1. Intel Labs, Santa Clara, California

2. Intel Labs, Hillsboro, Oregon

Abstract

Nowadays, data is represented by vectors. Retrieving those vectors, among millions and billions, that are similar to a given query is a ubiquitous problem, known as similarity search, of relevance for a wide range of applications. Graph-based indices are currently the best performing techniques for billion-scale similarity search. However, their random-access memory pattern presents challenges to realize their full potential. In this work, we present new techniques and systems for creating faster and smaller graph-based indices. To this end, we introduce a novel vector compression method, Locally-adaptive Vector Quantization (LVQ), that uses per-vector scaling and scalar quantization to improve search performance with fast similarity computations and a reduced effective bandwidth, while decreasing memory footprint and barely impacting accuracy. LVQ, when combined with a new high-performance computing system for graph-based similarity search, establishes the new state of the art in terms of performance and memory footprint. For billions of vectors, LVQ outcompetes the second-best alternatives: (1) in the low-memory regime, by up to 20.7x in throughput with up to a 3x memory footprint reduction, and (2) in the high-throughput regime by 5.8x with 1.4x less memory.

Publisher

Association for Computing Machinery (ACM)

Subject

General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development

Link

https://dl.acm.org/doi/pdf/10.14778/3611479.3611537

Reference60 articles.

1. Cecilia Aguerrebere Ishwar Bhati Mark Hildebrand Mariano Tepper and Ted Willke. 2023. Similarity search in the blink of an eye with compressed indices. arXiv:2304.04759 [cs.LG] Cecilia Aguerrebere Ishwar Bhati Mark Hildebrand Mariano Tepper and Ted Willke. 2023. Similarity search in the blink of an eye with compressed indices. arXiv:2304.04759 [cs.LG]

2. Approximate nearest neighbors and the fast Johnson-Lindenstrauss transform

3. Quicker ADC : Unlocking the Hidden Potential of Product Quantization With SIMD

4. Cache locality is not enough

5. Sunil Arya and David M Mount . 1993 . Approximate nearest neighbor queries in fixed dimensions . In Proceedings of the fourth annual ACM-SIAM symposium on Discrete algorithms , Vol. 93 . 271--280. Sunil Arya and David M Mount. 1993. Approximate nearest neighbor queries in fixed dimensions. In Proceedings of the fourth annual ACM-SIAM symposium on Discrete algorithms, Vol. 93. 271--280.

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. RaBitQ: Quantizing High-Dimensional Vectors with a Theoretical Error Bound for Approximate Nearest Neighbor Search;Proceedings of the ACM on Management of Data;2024-05-29

2. Reinforcement Learning Infused MAC for Adaptive Connectivity;2024 IEEE Wireless Communications and Networking Conference (WCNC);2024-04-21

3. Optimizing Resource Utilization Using Vector Databases in Green Internet of Things;2023 IEEE Globecom Workshops (GC Wkshps);2023-12-04

4. Lossy Compression Options for Dense Index Retention;Proceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region;2023-11-26

5. Similarity Search in the Blink of an Eye with Compressed Indices;Proceedings of the VLDB Endowment;2023-07