GWAS-Flow: A GPU accelerated framework for efficient permutation based genome-wide association studies-Reference-Cited by-同舟云学术

GWAS-Flow: A GPU accelerated framework for efficient permutation based genome-wide association studies

Published:2019-09-27 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Freudenthal Jan A.,Ankenbrand Markus J.,Grimm Dominik G.,Korte Arthur^ORCID

Abstract

AbstractMotivationGenome-wide association studies (GWAS) are one of the most commonly used methods to detect associations between complex traits and genomic polymorphisms. As both genotyping and phenotyping of large populations has become easier, typical modern GWAS have to cope with massive amounts of data. Thus, the computational demand for these analyses grew remarkably during the last decades. This is especially true, if one wants to implement permutation-based significance thresholds, instead of using the naïve Bonferroni threshold. Permutation-based methods have the advantage to provide an adjusted multiple hypothesis correction threshold that takes the underlying phenotypic distribution into account and will thus remove the need to find the correct transformation for non Gaussian phenotypes. To enable efficient analyses of large datasets and the possibility to compute permutation-based significance thresholds, we used the machine learning framework TensorFlow to develop a linear mixed model (GWAS-Flow) that can make use of the available CPU or GPU infrastructure to decrease the time of the analyses especially for large datasets.ResultsWe were able to show that our applicationGWAS-Flowoutperforms custom GWAS scripts in terms of speed without loosing accuracy. Apart from p-values,GWAS-Flowalso computes summary statistics, such as the effect size and its standard error for each individual marker. The CPU-based version is the default choice for small data, while the GPU-based version ofGWAS-Flowis especially suited for the analyses of big data.AvailabilityGWAS-Flowis freely available on GitHub (https://github.com/Joyvalley/GWAS_Flow) and is released under the terms of the MIT-License.

Publisher

Cold Spring Harbor Laboratory

Reference26 articles.

1. Friendly rivalry

2. 1,135 Genomes Reveal the Global Pattern of Polymorphism in Arabidopsis thaliana

3. N. Siva , “1000 genomes project,” 2008.

4. The 3,000 rice genomes project: new opportunities and challenges for future rice research

5. Statistical significance for genomewide studies

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Bioinformatic method for determining single nucleotide polymorphisms on the example of gene <i>WIN</i> in <i>Glycine max</i>;Proceedings of Universities. Applied Chemistry and Biotechnology;2023-01-02

2. DNARecords: An extensible sparse format for petabyte scale genomics analysis;2022-08-15

3. Efficient Permutation-based Genome-wide Association Studies for Normal and Skewed Phenotypic Distributions;2022-04-07

4. FPGA Acceleration of GWAS Permutation Testing;2022-03-14

5. Computational approaches toward single-nucleotide polymorphism discovery and its applications in plant breeding;Bioinformatics in Agriculture;2022