Privacy preservation by disassociation-Reference-Cited by-同舟云学术

Privacy preservation by disassociation

Published:2012-06 Issue:10 Volume:5 Page:944-955
ISSN:2150-8097
Container-title:Proceedings of the VLDB Endowment
language:en
Short-container-title:Proc. VLDB Endow.

Author:

Terrovitis Manolis¹,Mamoulis Nikos²,Liagouris John³,Skiadopoulos Spiros⁴

Affiliation:

1. IMIS, Research Center 'Athena'

2. Univ. of Hong Kong

3. NTUA

4. Univ. of Peloponnese

Abstract

In this work, we focus on protection against identity disclosure in the publication of sparse multidimensional data. Existing multidimensional anonymization techniques (a) protect the privacy of users either by altering the set of quasi-identifiers of the original data (e.g., by generalization or suppression) or by adding noise (e.g., using differential privacy) and/or (b) assume a clear distinction between sensitive and non-sensitive information and sever the possible linkage. In many real world applications the above techniques are not applicable. For instance, consider web search query logs. Suppressing or generalizing anonymization methods would remove the most valuable information in the dataset: the original query terms. Additionally, web search query logs contain millions of query terms which cannot be categorized as sensitive or non-sensitive since a term may be sensitive for a user and non-sensitive for another. Motivated by this observation, we propose an anonymization technique termed disassociation that preserves the original terms but hides the fact that two or more different terms appear in the same record. We protect the users' privacy by disassociating record terms that participate in identifying combinations. This way the adversary cannot associate with high probability a record with a rare combination of terms. To the best of our knowledge, our proposal is the first to employ such a technique to provide protection against identity disclosure . We propose an anonymization algorithm based on our approach and evaluate its performance on real and synthetic datasets, comparing it against other state-of-the-art methods based on generalization and differential privacy.

Publisher

VLDB Endowment

Subject

General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development

Link

https://dl.acm.org/doi/pdf/10.14778/2336664.2336668

Cited by 66 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A divide-and-conquer approach to privacy-preserving high-dimensional big data release;Journal of Information Security and Applications;2024-06

2. Efficient Multi-Source Anonymity for Aggregated Internet of Vehicles Datasets;Applied Sciences;2024-04-11

3. Preserving Individual Privacy from Inference Attack in Transaction Data Publishing;2023 Eighth International Conference on Informatics and Computing (ICIC);2023-12-08

4. A New Approach for Anonymizing Transaction Data with Set Values;Electronics;2023-07-12

5. An Improved Partitioning Method via Disassociation towards Environmental Sustainability;Sustainability;2023-04-30