Abstract
AbstractMotivationGenome-wide association interaction studies (GWAIS) are becoming increasingly important as estimates of genetic interactions at the genome-wide level using genome-wide data from hundreds of thousands of individuals from large biobanks showed that non-additive genetic variance plays a role in complex human traits in addition to additive genetic effects identified in genome-wide association studies (GWAS). However, a comprehensive genome-wide search for all combinations of second-order (SNPxSNP) or third-order (SNPxSNPxSNP) associations using millions of SNP markers is a very computationally intensive task, especially when hundreds of thousands or, in the near future, even millions of individuals can be studied with GWAS datasets. The runtime so far exceeds years, even if the search is performed on a multicore CPU server system.ResultsWe developedGWAIS-Web, a web service for fast analysis of genome-wide interactions with case-control GWAS datasets. By using a hybrid combination of graphics-processing units (GPUs) and field-programmable gate arrays (FPGAs),GWAIS-Webspeeds up epistasis detection methods for binary traits by a factor of more than 2000, allowing an exhaustive SNP-SNP GWAIS with a GWAS data set of one million SNPs and 500,000 individuals to complete within one day, which would take more than five years on a regular CPU server system. The user can choose between different methods for epistasis detection, such as logistic regression, BOOST, mutual information (MI) and others, with calculations in double precision and including on-the-fly filtering of correlated results based on linkage disequilibrium (LD). Due to the underlying common data structure ofGWAIS-Web, all methods can be combined and processed together on-the-fly without increasing the runtime. The user can choose between 2nd order (pairwise) and 3rd order tests and can also limit the search to selected chromosomal regions.GWAIS-Weboffers a high level of security through optional 2-factor authentication, encrypted connections and the protection of GWAS/user account data in accordance with the European General Data Protection Regulation (GDPR).AvailabilityGWAIS-Webis freely available athttps://hybridcomputing.ikmb.uni-kiel.de. The stand-alone softwareHybridGWAIScan be downloaded athttps://github.com/ikmb/hybridgwais.Supplementary informationSupplementary data are available online.
Publisher
Cold Spring Harbor Laboratory
Reference21 articles.
1. Calle, M. L. , Urrea, V. , Malats, N. and van Steen, K. (2007). MB-MDR: Model-Based Multifactor Dimension Reduction for detecting interactions in high-dimensional genomic data, Ann. Hum. Genet. 75.
2. SmileFinder: a resampling-based approach to evaluate signatures of selection from genome-wide sets of matching allele frequency data in two or more diploid populations
3. Genetic analysis of over 1 million people identifies 535 new loci associated with blood pressure traits
4. Federal Trade Commission (2012). Protecting consumer privacy in an era of rapid change. https://www.ftc.gov/sites/default/files/documents/reports/federal-trade-commission-report-protecting-consumer-privacy-era-rapid-change-recommendations/120326privacyreport.pdf.
5. A genome-wide association study identifies new psoriasis susceptibility loci and an interaction between HLA-C and ERAP1