Association studies for next-generation sequencing-Reference-Cited by-同舟云学术

Association studies for next-generation sequencing

Published:2011-04-26 Issue:7 Volume:21 Page:1099-1108
ISSN:1088-9051
Container-title:Genome Research
language:en
Short-container-title:Genome Res.

Author:

Luo Li,Boerwinkle Eric,Xiong Momiao

Abstract

Genome-wide association studies (GWAS) have become the primary approach for identifying genes with common variants influencing complex diseases. Despite considerable progress, the common variations identified by GWAS account for only a small fraction of disease heritability and are unlikely to explain the majority of phenotypic variations of common diseases. A potential source of the missing heritability is the contribution of rare variants. Next-generation sequencing technologies will detect millions of novel rare variants, but these technologies have three defining features: identification of a large number of rare variants, a high proportion of sequence errors, and a large proportion of missing data. These features raise challenges for testing the association of rare variants with phenotypes of interest. In this study, we use a genome continuum model and functional principal components as a general principle for developing novel and powerful association analysis methods designed for resequencing data. We use simulations to calculate the type I error rates and the power of nine alternative statistics: two functional principal component analysis (FPCA)–based statistics, the multivariate principal component analysis (MPCA)–based statistic, the weighted sum (WSS), the variable-threshold (VT) method, the generalized T2, the collapsing method, the CMC method, and individual

tests. We also examined the impact of sequence errors on their type I error rates. Finally, we apply the nine statistics to the published resequencing data set from ANGPTL4 in the Dallas Heart Study. We report that FPCA-based statistics have a higher power to detect association of rare variants and a stronger ability to filter sequence errors than the other seven methods.

Publisher

Cold Spring Harbor Laboratory

Subject

Genetics (clinical),Genetics

Reference31 articles.

1. Accurate detection and genotyping of SNPs utilizing population sequencing data

2. Statistical analysis strategies for association studies involving rare variants

3. The probability distribution of the amount of an individual's genome surviving to the following generation;Genetics,1996

4. De novo fragment assembly with short mate-paired reads: Does the read length matter?

5. Multiple rare variants in NPC1L1 associated with reduced sterol absorption and plasma low-density lipoprotein levels

Cited by 83 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Next-Generation Sequencing Data-Based Association Testing of a Group of Genetic Markers for Complex Responses Using a Generalized Linear Model Framework;Mathematics;2023-06-02

2. Association Testing of a Group of Genetic Markers Based on Next-Generation Sequencing Data and Continuous Response Using a Linear Model Framework;Mathematics;2023-03-07

3. VIVID: A Web Application for Variant Interpretation and Visualization in Multi-dimensional Analyses;Molecular Biology and Evolution;2022-09-01

4. Protein Sequencing, One Molecule at a Time;Annual Review of Biophysics;2022-05-09

5. A tree‐based gene–environment interaction analysis with rare features;Statistical Analysis and Data Mining: The ASA Data Science Journal;2022-03