Mining frequent patterns with differential privacy-Reference-Cited by-同舟云学术

Mining frequent patterns with differential privacy

Published:2013-08-28 Issue:12 Volume:6 Page:1422-1427
ISSN:2150-8097
Container-title:Proceedings of the VLDB Endowment
language:en
Short-container-title:Proc. VLDB Endow.

Author:

Bonomi Luca¹,Xiong Li¹

Affiliation:

1. Department of Mathematics & Computer Science, Emory University, Atlanta

Abstract

The mining of frequent patterns is a fundamental component in many data mining tasks. A considerable amount of research on this problem has led to a wide series of efficient and scalable algorithms for mining frequent patterns. However, releasing these patterns is posing concerns on the privacy of the users participating in the data. Indeed the information from the patterns can be linked with a large amount of data available from other sources creating opportunities for adversaries to break the individual privacy of the users and disclose sensitive information. In this proposal, we study the mining of frequent patterns in a privacy preserving setting. We first investigate the difference between sequential and itemset patterns, and second we extend the definition of patterns by considering the absence and presence of noise in the data. This leads us in distinguishing the patterns between exact and noisy. For exact patterns, we describe two novel mining techniques that we previously developed. The first approach has been applied in a privacy preserving record linkage setting, where our solution is used to mine frequent patterns which are employed in a secure transformation procedure to link records that are similar. The second approach improves the mining utility results using a two-phase strategy which allows to effectively mine frequent substrings as well as prefixes patterns. For noisy patterns, first we formally define the patterns according to the type of noise and second we provide a set of potential applications that require the mining of these patterns. We conclude the paper by stating the challenges in this new setting and possible future research directions.

Publisher

VLDB Endowment

Subject

General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development

Link

https://dl.acm.org/doi/pdf/10.14778/2536274.2536329

Cited by 15 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Histogram Publishing Method under Differential Privacy That Involves Balancing Small-Bin Availability First;Algorithms;2024-07-04

2. Privacy Amplification via Shuffling: Unified, Simplified, and Tightened;Proceedings of the VLDB Endowment;2024-04

3. Privacy-Enhanced Frequent Sequence Mining and Retrieval for Personalized Behavior Prediction;IEEE Transactions on Information Forensics and Security;2024

4. Novel FDP mechanisms for releasing bipartite graph data on fixed and infinite intervals;Journal of Intelligent & Fuzzy Systems;2023-04-03

5. Fuzzy Differential Privacy Theory and Its Applications in Subgraph Counting;IEEE Transactions on Fuzzy Systems;2023-02