Survey on Privacy-Preserving Techniques for Microdata Publication-Reference-Cited by-同舟云学术

Survey on Privacy-Preserving Techniques for Microdata Publication

Published:2023-07-17 Issue:14s Volume:55 Page:1-42
ISSN:0360-0300
Container-title:ACM Computing Surveys
language:en
Short-container-title:ACM Comput. Surv.

Author:

Carvalho Tânia¹^ORCID,Moniz Nuno²^ORCID,Faria Pedro³^ORCID,Antunes Luís¹^ORCID

Affiliation:

1. University of Porto

2. INESC TEC/University of Porto

3. TekPrivacy

Abstract

The exponential growth of collected, processed, and shared microdata has given rise to concerns about individuals’ privacy. As a result, laws and regulations have emerged to control what organisations do with microdata and how they protect it. Statistical Disclosure Control seeks to reduce the risk of confidential information disclosure by de-identifying them. Such de-identification is guaranteed through privacy-preserving techniques (PPTs). However, de-identified data usually results in loss of information, with a possible impact on data analysis precision and model predictive performance. The main goal is to protect the individual’s privacy while maintaining the interpretability of the data (i.e., its usefulness). Statistical Disclosure Control is an area that is expanding and needs to be explored since there is still no solution that guarantees optimal privacy and utility. This survey focuses on all steps of the de-identification process. We present existing PPTs used in microdata de-identification, privacy measures suitable for several disclosure types, and information loss and predictive performance measures. In this survey, we discuss the main challenges raised by privacy constraints, describe the main approaches to handle these obstacles, review the taxonomies of PPTs, provide a theoretical analysis of existing comparative studies, and raise multiple open issues.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science,Theoretical Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3588765

Reference257 articles.

1. Security-control methods for statistical databases: A comparative study;Adam Nabil R.;ACM Computing Surveys,1989

2. Aircloak GmbH. 2021. Aircloak. Retrieved November 1 2021 from https://aircloak.com/.

3. An efficient approach for publishing microdata for multiple sensitive attributes;Anjum Adeel;Journal of Supercomputing,2018

4. Martin Arjovsky, Soumith Chintala, and Léon Bottou. 2017. Wasserstein generative adversarial networks. In Proceedings of the International Conference on Machine Learning. 214–223.

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. FedGKD: Federated Graph Knowledge Distillation for privacy-preserving rumor detection;Knowledge-Based Systems;2024-11

2. Privacy-preserving generation and publication of synthetic trajectory microdata: A comprehensive survey;Journal of Network and Computer Applications;2024-10

3. A survey on privacy-preserving control and filtering of networked control systems;International Journal of Systems Science;2024-04-30

4. Assessing the Potentials of LLMs and GANs as State-of-the-Art Tabular Synthetic Data Generation Methods;Lecture Notes in Computer Science;2024

5. Synthetic Data Outliers: Navigating Identity Disclosure;Lecture Notes in Computer Science;2024