On the Asymptotic Capacity of Information-Theoretic Privacy-Preserving Epidemiological Data Collection
Author:
Cheng Jiale1ORCID, Liu Nan1ORCID, Kang Wei2
Affiliation:
1. National Mobile Communications Research Laboratory, Southeast University, Nanjing 211189, China 2. School of Information Science and Engineering, Southeast University, Nanjing 211189, China
Abstract
The paradigm-shifting developments of cryptography and information theory have focused on the privacy of data-sharing systems, such as epidemiological studies, where agencies are collecting far more personal data than they need, causing intrusions on patients’ privacy. To study the capability of the data collection while protecting privacy from an information theory perspective, we formulate a new distributed multiparty computation problem called privacy-preserving epidemiological data collection. In our setting, a data collector requires a linear combination of K users’ data through a storage system consisting of N servers. Privacy needs to be protected when the users, servers, and data collector do not trust each other. For the users, any data are required to be protected from up to E colluding servers; for the servers, any more information than the desired linear combination cannot be leaked to the data collector; and for the data collector, any single server can not know anything about the coefficients of the linear combination. Our goal is to find the optimal collection rate, which is defined as the ratio of the size of the user’s message to the total size of downloads from N servers to the data collector. For achievability, we propose an asymptotic capacity-achieving scheme when E<N−1, by applying the cross-subspace alignment method to our construction; for the converse, we proved an upper bound of the asymptotic rate for all achievable schemes when E<N−1. Additionally, we show that a positive asymptotic capacity is not possible when E≥N−1. The results of the achievability and converse meet when the number of users goes to infinity, yielding the asymptotic capacity. Our work broadens current researches on data privacy in information theory and gives the best achievable asymptotic performance that any epidemiological data collector can obtain.
Funder
National Natural Science Foundation of China Research Fund of National Mobile Communications Research Laboratory, Southeast University
Subject
General Physics and Astronomy
Reference33 articles.
1. Kim, J., and Kwon, O. (2021). A Model for Rapid Selection and COVID-19 Prediction with Dynamic and Imbalanced Data. Sustainability, 13. 2. Olson, D., Lamb, M., Lopez, M.R., Colborn, K., Paniagua-Avila, A., Zacarias, A., Zambrano-Perilla, R., Rodríguez-Castro, S.R., Cordon-Rosales, C., and Asturias, E.J. (2017). Performance of a Mobile Phone App-Based Participatory Syndromic Surveillance System for Acute Febrile Illness and Acute Gastroenteritis in Rural Guatemala. J. Med. Internet Res., 19. 3. An Ecological Momentary Assessment of Primiparous Women’s Breastfeeding Behavior and Problems From Birth to 8 Weeks;Demirci;J. Hum. Lact.,2017 4. Silva de Lima, A.L., Hahn, T., Evers, L.J.W., de Vries, N.M., Cohen, E., Afek, M., Bataille, L., Daeschler, M., Claes, K., and Boroojerdi, B. (2017). Feasibility of large-scale deployment of multiple wearable sensors in Parkinson’s disease. PLoS ONE, 12. 5. COVID-19, digital privacy, and the social limits on data-focused public health responses;Fahey;Int. J. Inf. Manag.,2020
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|