Return of the Lernaean Hydra-Reference-Cited by-同舟云学术

Return of the Lernaean Hydra

Published:2019-11 Issue:3 Volume:13 Page:403-420
ISSN:2150-8097
Container-title:Proceedings of the VLDB Endowment
language:en
Short-container-title:Proc. VLDB Endow.

Author:

Echihabi Karima¹,Zoumpatianos Kostas²,Palpanas Themis³,Benbrahim Houda¹

Affiliation:

1. Mohammed V Univ.

2. Harvard University

3. Université de Paris

Abstract

Data series are a special type of multidimensional data present in numerous domains, where similarity search is a key operation that has been extensively studied in the data series literature. In parallel, the multidimensional community has studied approximate similarity search techniques. We propose a taxonomy of similarity search techniques that reconciles the terminology used in these two domains, we describe modifications to data series indexing techniques enabling them to answer approximate similarity queries with quality guarantees, and we conduct a thorough experimental evaluation to compare approximate similarity search techniques under a unified framework, on synthetic and real datasets in memory and on disk. Although data series differ from generic multidimensional vectors (series usually exhibit correlation between neighboring values), our results show that data series techniques answer approximate queries with strong guarantees and an excellent empirical performance, on data series and vectors alike. These techniques outperform the state-of-the-art approximate techniques for vectors when operating on disk, and remain competitive in memory.

Publisher

VLDB Endowment

Subject

General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development

Link

https://dl.acm.org/doi/pdf/10.14778/3368289.3368303

Cited by 50 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. DumpyOS: A data-adaptive multi-ary index for scalable data series similarity search;The VLDB Journal;2024-08-21

2. Survey of vector database management systems;The VLDB Journal;2024-07-15

3. Beyond the Dimensions: A Structured Evaluation of Multivariate Time Series Distance Measures;2024 IEEE 40th International Conference on Data Engineering Workshops (ICDEW);2024-05-13

4. Routing-Guided Learned Product Quantization for Graph-Based Approximate Nearest Neighbor Search;2024 IEEE 40th International Conference on Data Engineering (ICDE);2024-05-13

5. CLIMBER: Pivot-Based Approximate Similarity Search Over Big Data Series;2024 IEEE 40th International Conference on Data Engineering (ICDE);2024-05-13