The RLR-Tree: A Reinforcement Learning Based R-Tree for Spatial Data-Reference-Cited by-同舟云学术

The RLR-Tree: A Reinforcement Learning Based R-Tree for Spatial Data

Published:2023-05-26 Issue:1 Volume:1 Page:1-26
ISSN:2836-6573
Container-title:Proceedings of the ACM on Management of Data
language:en
Short-container-title:Proc. ACM Manag. Data

Author:

Gu Tu¹^ORCID,Feng Kaiyu¹^ORCID,Cong Gao¹^ORCID,Long Cheng¹^ORCID,Wang Zheng¹^ORCID,Wang Sheng²^ORCID

Affiliation:

1. Nanyang Technological University, Singapore, Singapore

2. Alibaba Group, Singapore, Singapore

Abstract

Learned indexes have been proposed to replace classic index structures like B-Tree with machine learning (ML) models. They require to replace both the indexes and query processing algorithms currently deployed by the databases, and such a radical departure is likely to encounter challenges and obstacles. In contrast, we propose a fundamentally different way of using ML techniques to build a better R-Tree without the need to change the structure or query processing algorithms of traditional R-Tree. Specifically, we develop reinforcement learning (RL) based models to decide how to choose a subtree for insertion and how to split a node when building and updating an R-Tree, instead of relying on hand-crafted heuristic rules currently used by the R-Tree and its variants. Experiments on real and synthetic datasets with up to more than 100 million spatial objects show that our RL based index outperforms the R-Tree and its variants in terms of query processing time.

Funder

Alibaba-NTU Singapore Joint Research Institute

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3588917

Reference45 articles.

1. The priority R-tree

2. The R*-tree: an efficient and robust access method for points and rectangles

3. A revised r*-tree in comparison with related index structures

4. Multidimensional binary search trees used for associative searching

5. Angjela Davitkova Evica Milchevski and Sebastian Michel. 2020. The ML-Index: A Multidimensional Learned Index for Point Range and Nearest-Neighbor Queries.. In EDBT. 407--410. Angjela Davitkova Evica Milchevski and Sebastian Michel. 2020. The ML-Index: A Multidimensional Learned Index for Point Range and Nearest-Neighbor Queries.. In EDBT. 407--410.

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Survey of Multi-Dimensional Indexes: Past and Future Trends;IEEE Transactions on Knowledge and Data Engineering;2024-08

2. The Holon Approach for Simultaneously Tuning Multiple Components in a Self-Driving Database Management System with Machine Learning via Synthesized Proto-Actions;Proceedings of the VLDB Endowment;2024-07

3. Machine Learning for Databases: Foundations, Paradigms, and Open problems;Companion of the 2024 International Conference on Management of Data;2024-06-09

4. Unicorn: A Unified Multi-Tasking Matching Model;ACM SIGMOD Record;2024-05-14

5. Chameleon: Towards Update-Efficient Learned Indexing for Locally Skewed Data;2024 IEEE 40th International Conference on Data Engineering (ICDE);2024-05-13