Affiliation:
1. Basic Algorithms Research Copenhagen IT University of Copenhagen, Copenhagen, Denmark
Abstract
We consider the problem of computing differentially private approximate histograms and heavy hitters in a stream of elements. In the non-private setting, this is often done using the sketch of Misra and Gries [Science of Computer Programming, 1982]. Chan, Li, Shi, and Xu [PETS 2012] describe a differentially private version of the Misra-Gries sketch, but the amount of noise it adds can be large and scales linearly with the size of the sketch; the more accurate the sketch is, the more noise this approach has to add. We present a better mechanism for releasing a Misra-Gries sketch under (ε, δ)-differential privacy. It adds noise with magnitude independent of the size of the sketch; in fact, the maximum error coming from the noise is the same as the best known in the private non-streaming setting, up to a constant factor. Our mechanism is simple and likely to be practical. In the full version of the paper we also give a simple post-processing step of the Misra-Gries sketch that does not increase the worst-case error guarantee. It is sufficient to add noise to this new sketch with less than twice the magnitude of the non-streaming setting. This improves on the previous result for "-differential privacy where the noise scales linearly to the size of the sketch.
Publisher
Association for Computing Machinery (ACM)
Reference17 articles.
1. Representing Sparse Vectors with Differential Privacy, Low Error, Optimal Space, and Fast Access
2. Differential Privacy on Finite Computers
3. R. Bassily, K. Nissim, U. Stemmer, and A. Guha Thakurta. Practical locally private heavy hitters. Advances in Neural Information Processing Systems, 30, 2017.
4. Secure Multi-party Computation of Differentially Private Heavy Hitters
5. P. Bose E. Kranakis P. Morin and Y. Tang. Bounds for frequency estimation of packet streams. In J. F. Sibeyn editor SIROCCO 10: Proceedings of the 10th Internaltional Colloquium on Structural Information Complexity June 18--20 2003 Ume°a Sweden volume 17 of Proceedings in Informatics pages 33--42. Carleton Scientific 2003.