Dynamically adjusting case reporting policy to maximize privacy and public health utility in the face of a pandemic

Author:

Brown J Thomas1ORCID,Yan Chao12ORCID,Xia Weiyi1ORCID,Yin Zhijun12ORCID,Wan Zhiyu12,Gkoulalas-Divanis Aris3,Kantarcioglu Murat4,Malin Bradley A125

Affiliation:

1. Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee, USA

2. Department of Computer Science, Vanderbilt University, Nashville, Tennessee, USA

3. IBM Watson Health, Cambridge, Massachusetts, USA

4. Department of Computer Science, University of Texas at Dallas, Dallas, Texas, USA

5. Department of Biostatistics, Vanderbilt University Medical Center, Nashville, Tennessee, USA

Abstract

Abstract Objective Supporting public health research and the public’s situational awareness during a pandemic requires continuous dissemination of infectious disease surveillance data. Legislation, such as the Health Insurance Portability and Accountability Act of 1996 and recent state-level regulations, permits sharing deidentified person-level data; however, current deidentification approaches are limited. Namely, they are inefficient, relying on retrospective disclosure risk assessments, and do not flex with changes in infection rates or population demographics over time. In this paper, we introduce a framework to dynamically adapt deidentification for near-real time sharing of person-level surveillance data. Materials and Methods The framework leverages a simulation mechanism, capable of application at any geographic level, to forecast the reidentification risk of sharing the data under a wide range of generalization policies. The estimates inform weekly, prospective policy selection to maintain the proportion of records corresponding to a group size less than 11 (PK11) at or below 0.1. Fixing the policy at the start of each week facilitates timely dataset updates and supports sharing granular date information. We use August 2020 through October 2021 case data from Johns Hopkins University and the Centers for Disease Control and Prevention to demonstrate the framework’s effectiveness in maintaining the PK11 threshold of 0.01. Results When sharing COVID-19 county-level case data across all US counties, the framework’s approach meets the threshold for 96.2% of daily data releases, while a policy based on current deidentification techniques meets the threshold for 32.3%. Conclusion Periodically adapting the data publication policies preserves privacy while enhancing public health utility through timely updates and sharing epidemiologically critical features.

Funder

National Science Foundation and training

National Library of Medicine

Publisher

Oxford University Press (OUP)

Subject

Health Informatics

Cited by 4 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Private Continuous Survival Analysis with Distributed Multi-Site Data;2023 IEEE International Conference on Big Data (BigData);2023-12-15

2. PanDa Game: Optimized Privacy-Preserving Publishing of Individual-Level Pandemic Data Based on a Game Theoretic Model;IEEE Transactions on NanoBioscience;2023-10

3. Privacy-Preserving Publishing of Individual-Level Pandemic Data Based on a Game Theoretic Model;2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM);2022-12-06

4. How Adversarial Assumptions Influence Re-identification Risk Measures: A COVID-19 Case Study;Privacy in Statistical Databases;2022

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3