Affiliation:
1. University of Illinois at Urbana-Champaign
2. Bilkent University
3. Vanderbilt University
4. Ecole Polytechnique Federale de Lausanne
5. Indiana University at Bloomington
Abstract
Genome sequencing technology has advanced at a rapid pace and it is now possible to generate highly-detailed genotypes inexpensively. The collection and analysis of such data has the potential to support various applications, including personalized medical services. While the benefits of the genomics revolution are trumpeted by the biomedical community, the increased availability of such data has major implications for personal privacy; notably because the genome has certain essential features, which include (but are not limited to)
(i)
an association with traits and certain diseases,
(ii)
identification capability (e.g., forensics), and
(iii)
revelation of family relationships. Moreover, direct-to-consumer DNA testing increases the likelihood that genome data will be made available in less regulated environments, such as the Internet and for-profit companies. The problem of genome data privacy thus resides at the crossroads of computer science, medicine, and public policy. While the computer scientists have addressed data privacy for various data types, there has been less attention dedicated to genomic data. Thus, the goal of this paper is to provide a systematization of knowledge for the computer science community. In doing so, we address some of the (sometimes erroneous) beliefs of this field and we report on a survey we conducted about genome data privacy with biomedical specialists. Then, after characterizing the genome privacy problem, we review the state-of-the-art regarding privacy attacks on genomic data and strategies for mitigating such attacks, as well as contextualizing these attacks from the perspective of medicine and public policy. This paper concludes with an enumeration of the challenges for genome data privacy and presents a framework to systematize the analysis of threats and the design of countermeasures as the field moves forward.
Funder
National Science Foundation
Swiss National Science Foundation
Centre Hospitalier Universitaire Vaudois
National Institutes of Health
Publisher
Association for Computing Machinery (ACM)
Subject
General Computer Science,Theoretical Computer Science
Reference142 articles.
1. Order preserving encryption for numeric data
2. UK Biobank Data: Come and Get It
3. Data Re-Identification: Societal Safeguards
4. Challenges for Biomedical Informatics and Pharmacogenomics
5. Mary R. Anderlik. 2003. Assessing the quality of DNA-based parentage testing: findings from a survey of laboratories. Jurimetrics 291--314. Mary R. Anderlik. 2003. Assessing the quality of DNA-based parentage testing: findings from a survey of laboratories. Jurimetrics 291--314.
Cited by
201 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献