Revisiting genome-wide association studies from statistical modelling to machine learning-Reference-Cited by-同舟云学术

Revisiting genome-wide association studies from statistical modelling to machine learning

Published:2020-10-30 Issue:4 Volume:22 Page:
ISSN:1467-5463
Container-title:Briefings in Bioinformatics
language:en
Short-container-title:

Author:

Sun Shanwen¹,Dong Benzhi²,Zou Quan¹

Affiliation:

1. Institute of Fundamental and Frontier Sciences at the University of Electronic Science and Technology of China, Chengdu, China

2. College of Computer Science and Engineering, Northeast Forestry University, Harbin, China

Abstract

Abstract Over the last decade, genome-wide association studies (GWAS) have discovered thousands of genetic variants underlying complex human diseases and agriculturally important traits. These findings have been utilized to dissect the biological basis of diseases, to develop new drugs, to advance precision medicine and to boost breeding. However, the potential of GWAS is still underexploited due to methodological limitations. Many challenges have emerged, including detecting epistasis and single-nucleotide polymorphisms (SNPs) with small effects and distinguishing causal variants from other SNPs associated through linkage disequilibrium. These issues have motivated advancements in GWAS analyses in two contrasting cultures—statistical modelling and machine learning. In this review, we systematically present the basic concepts and the benefits and limitations in both methods. We further discuss recent efforts to mitigate their weaknesses. Additionally, we summarize the state-of-the-art tools for detecting the missed signals, ultrarare mutations and gene–gene interactions and for prioritizing SNPs. Our work can offer both theoretical and practical guidelines for performing GWAS analyses and for developing further new robust methods to fully exploit the potential of GWAS.

Funder

National Natural Science Foundation of China

Publisher

Oxford University Press (OUP)

Subject

Molecular Biology,Information Systems

Link

http://academic.oup.com/bib/article-pdf/22/4/bbaa263/39129874/bbaa263.pdf

Reference103 articles.

1. Genome-wide association studies for common diseases and complex traits;Hirschhorn;Nat Rev Genet,2005

2. Benefits and limitations of genome-wide association studies;Tam;Nat Rev Genet,2019

3. Crop genome-wide association study: a harvest of biological relevance;Liu;Plant J,2019