Affiliation:
1. Renmin University of China
2. Hong Kong Polytechnic University
Abstract
Answering range queries in the context of Local Differential Privacy (LDP) is a widely studied problem in Online Analytical Processing (OLAP). Existing LDP solutions all assume a uniform data distribution within each domain partition, which may not align with real-world scenarios where data distribution is varied, resulting in inaccurate estimates. To address this problem, we introduce PriPL-Tree, a novel data structure that combines hierarchical tree structures with piecewise linear (PL) functions to answer range queries for arbitrary distributions. PriPL-Tree precisely models the underlying data distribution with a few line segments, leading to more accurate results for range queries. Furthermore, we extend it to multi-dimensional cases with novel data-aware adaptive grids. These grids leverage the insights from marginal distributions obtained through PriPL-Trees to partition the grids adaptively, adapting the density of underlying distributions. Our extensive experiments on both real and synthetic datasets demonstrate the effectiveness and superiority of PriPL-Tree over state-of-the-art solutions in answering range queries across arbitrary data distributions.
Publisher
Association for Computing Machinery (ACM)
Reference51 articles.
1. Barry Becker and Ronny Kohavi. 1996. Adult. (15 Jul. 2024). 10.24432/C5XW20
2. Chiranjeeb Buragohain, Nisheeth Shrivastava, and Subhash Suri. 2006. Space Efficient Streaming Algorithms for the Maximum Error Histogram. In Proceedings of the 23rd International Conference on Data Engineering. 1026--1035.
3. Marginal Release Under Local Differential Privacy
4. Answering range queries under local differential privacy
5. Differentially Private Spatial Decompositions