Affiliation:
1. Department of Information Systems, College of Computer Science and Information Technology, King Faisal University, Al-Ahsa 31982, Saudi Arabia
Abstract
The amount of data created by individuals increases daily. These data may be gathered from various sources, such as social networks, e-commerce websites and healthcare systems, and they are frequently made available to third-party research and commercial organisations to facilitate a wide range of data studies. The protection of sensitive and confidential information included within the datasets to be published must be addressed, even though publishing data can assist organisations in improving their service offerings and developing new solutions that would not otherwise be available. The research community has invested great effort over the past two decades to comprehend how individuals’ privacy may be preserved when their data need to be published. Disassociation is a common approach for anonymising transactional data against re-identification attacks in privacy-preserving data publishing. To address this issue, we proposed three new strategies for horizontal partitioning: suppression, adding and remaining list. Each strategy identifies a different approach for handling small clusters with fewer than k transactions. We used three real datasets for transactional data in our experiments, and our findings showed that our proposed strategies could decrease the percentage of information loss of disassociated transactional data by almost 35%, comparing it with the previous original disassociation algorithm. As a result, the utility of published data will be improved.
Funder
Ministry of Education in Saudi Arabia
Subject
Management, Monitoring, Policy and Law,Renewable Energy, Sustainability and the Environment,Geography, Planning and Development,Building and Construction
Reference38 articles.
1. Big data for all: Privacy and user control in the age of analytics;Tene;Northwestern J. Technol. Intellect. Prop.,2012
2. Synthesizing high-frequency rules from different data sources;Wu;IEEE Trans. Knowl. Data Eng.,2003
3. Data science: A game changer for science and innovation;Grossi;Int. J. Data Sci. Anal.,2021
4. Adoption of big data analytics practices for sustainability development in the e-commercesupply chain: A mixed-method study;Gangwar;Int. J. Qual. Reliab. Manag.,2023
5. Li, N., Li, T., and Venkatasubramanian, S. (2007, January 15–20). t-closeness: Privacy beyond k-anonymity and l-diversity. Proceedings of the 2007 IEEE 23rd International Conference on Data Engineering, Istanbul, Turkey.