Privacy-Preserving Data Sharing for Genome-Wide Association Studies
-
Published:2013-08-01
Issue:1
Volume:5
Page:
-
ISSN:2575-8527
-
Container-title:Journal of Privacy and Confidentiality
-
language:
-
Short-container-title:JPC
Author:
Uhler CarolineORCID,
Slavkovic Aleksandra B.,
Fienberg Stephen E.ORCID
Abstract
Traditional statistical methods for confidentiality protection of statistical databases do not scale well to deal with GWAS databases especially in terms of guarantees regarding protection from linkage to external information. The more recent concept of differential privacy, introduced by the cryptographic community, is an approach which provides a rigorous definition of privacy with meaningful privacy guarantees in the presence of arbitrary external information, although the guarantees may come at a serious price in terms of data utility. Building on such notions, we propose new methods to release aggregate GWAS data without compromising an individual’s privacy. We present methods for releasing differentially private minor allele frequencies, chi-square statistics and p-values. We compare these approaches on simulated data and on a GWAS study of canine hair length involving 685 dogs. We also propose a privacy-preserving method for finding genome-wide associations based on a differentially-private approach to penalized logistic regression.
Funder
National Science Foundation
National Center for Research Resources
National Center for Advancing Translational Sciences
Publisher
Journal of Privacy and Confidentiality
Subject
Computer Science Applications,Statistics and Probability,Computer Science (miscellaneous)
Cited by
25 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献