Abstract
AbstractMulti-key membership testing plays a crucial role in computing systems and networking applications, encompassing web search, mail systems, distributed databases, firewalls, and network routing. Traditional approaches, such as the Bloom filter, encounter limitations within this specific context. Addressing these challenges, we propose the Multi-key Learned Bloom Filter (MLBF), a hybrid method that combines machine learning techniques with the Bloom filter. The MLBF introduces a value-interaction-based multi-key classifier and a multi-key Bloom filter. Furthermore, we introduce an Interval-based MLBF approach, which categorizes keys into specific intervals based on data distribution to minimize the False Positive Rate (FPR). Additionally, MLBF incorporates an out-of-distribution (OOD) detection component to identify data shifts. Through extensive experimental evaluations on three authentic datasets, we demonstrate the superiority of the proposed MLBF in terms of FPR and query efficiency.
Funder
National Natural Science Foundation of China
Shenzhen Municipal Science and Technology R &D Funding Basic Research Program
Municipal Government of Quzhou under Grant
Publisher
Springer Science and Business Media LLC
Subject
Computer Science Applications,Artificial Intelligence,Information Systems,Software
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. LBSC: A Cost-Aware Caching Framework for Cloud Databases;2024 IEEE 40th International Conference on Data Engineering (ICDE);2024-05-13