Author:
Kim Miran,Wang Su,Jiang Xiaoqian,Harmanci Arif
Abstract
AbstractBackgroundSequencing of thousands of samples provides genetic variants with allele frequencies spanning a very large spectrum and gives invaluable insight for genetic determinants of diseases. Protecting the genetic privacy of participants is challenging as only a few rare variants can easily re-identify an individual among millions. In certain cases, there are policy barriers against sharing genetic data from indigenous populations and stigmatizing conditions.ResultsWe present SVAT, a method for secure outsourcing of variant annotation and aggregation, which are two basic steps in variant interpretation and detection of causal variants. SVAT uses homomorphic encryption to encrypt the data at the client-side. The data always stays encrypted while it is stored, in-transit, and most importantly while it is analyzed. SVAT makes use of a vectorized data representation to convert annotation and aggregation into efficient vectorized operations in a single framework. Also, SVAT utilizes a secure re-encryption approach so that multiple disparate genotype datasets can be combined for federated aggregation and secure computation of allele frequencies on the aggregated dataset.ConclusionsOverall, SVAT provides a secure, flexible, and practical framework for privacy-aware outsourcing of annotation, filtering, and aggregation of genetic variants. SVAT is publicly available for download from https://github.com/harmancilab/SVAT
Publisher
Cold Spring Harbor Laboratory
Reference71 articles.
1. Integrating common and rare genetic variation in diverse human populations
2. A global reference for human genetic variation
3. Caulfield M , Davies J , Dennys M , Elbahy L , Fowler T , Hill S , et al. The 100,000 Genomes Project Protocol. Genomics Engl. 2015; February.
4. Collins FS . The Cancer Genome Atlas (TCGA). Online. 2007;:1–17.
5. NHLBI. NHLBI Trans-Omics for Precision Medicine Whole Genome Sequencing Program. TOPMed. https://www.nhlbiwgs.org/. 2016.