Priors, population sizes, and power in genome-wide hypothesis tests

Author:

Cai Jitong,Zhan Jianan,Arking Dan E.,Bader Joel S.ORCID

Abstract

AbstractGenome-wide tests, including genome-wide association studies (GWAS) of germ-line genetic variants, driver tests of cancer somatic mutations, and transcriptome-wide association tests of RNA-Seq data, carry a high multiple testing burden. This burden can be overcome by enrolling larger cohorts or alleviated by using prior biological knowledge to favor some hypotheses over others. Here we compare these two methods in terms of their abilities to boost the power of hypothesis testing. We provide a quantitative estimate for progress in cohort sizes, and present a theoretical analysis of the power of oracular hard priors: priors that select a subset of hypotheses for testing, with an oracular guarantee that all true positives are within the tested subset. This theory demonstrates that for GWAS, strong priors that limit testing to 100–1000 genes provide less power than typical annual 20–40% increases in cohort sizes. These theoretical results explain the continued dominance of simple, unbiased univariate hypothesis tests for RNA-Seq studies and GWAS: if a statistical question can be answered by larger cohort sizes, it should be answered by larger cohort sizes rather than by more complicated biased methods involving priors. We suggest that priors are better suited for non-statistical aspects of biology, such as pathway structure and causality, that are not yet easily captured by standard hypothesis tests.Author summaryBiological experiments often test thousands to millions of hypotheses. Gene-based tests for human RNA-Seq data, for example, involve approximately 20,000 tests; genome-wide association studies (GWAS) involve about 1 million effective tests. A robust approach is to perform individual tests and then apply a Bonferroni correction to account for multiple testing. This approach implies a single-test p-value of 2.5 × 10−6 for RNA-Seq experiments, and a p-value of 5 × 10−8 for GWAS, to control the false-positive rate at a conventional value of 0.05. Many methods have been proposed to alleviate the multiple-testing burden by incorporating a prior probability that boosts the significance for a subset of candidate genes or variants. At the extreme limit, only hypotheses within a candidate set are tested, corresponding to a decreased multiple testing burden. Despite decades of methods development, prior-based tests have not been generally used. Here we compare the power increase possible with a prior with the power increase from a much simpler strategy of increasing a study size. We show that increasing the population size is exponentially more valuable than increasing the strength of prior, even when the true prior is known exactly. Furthermore, even modest yearly increases in actual GWAS cohorts can yield power gains beyond the reach of any reasonable prior. These results provide a rigorous explanation for the continued use of simple, robust methods rather than more sophisticated approaches. They suggest that the value of priors is not in multiple hypothesis testing but rather in non-statistical aspects of interpretation including pathway structure and causality.

Publisher

Cold Spring Harbor Laboratory

Reference18 articles.

1. The Future of Genetic Studies of Complex Human Diseases

2. Cramming more components onto integrated circuits;Electronics,1965

3. The Pace and Proliferation of Biological Technologies

4. Potential Etiologic and Functional Implications of Genome-Wide Association Loci for Human Diseases;Proceedings of the National Academy of Sciences,2009

5. All SNPs Are Not Created Equal: Genome-Wide Association Studies Reveal a Consistent Pattern of Enrichment among Functionally Annotated SNPs;PLOS Genetics,2013

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3