Computing Power and Sample Size for the False Discovery Rate in Multiple Applications-Reference-Cited by-同舟云学术

Computing Power and Sample Size for the False Discovery Rate in Multiple Applications

Published:2024-03-07 Issue:3 Volume:15 Page:344
ISSN:2073-4425
Container-title:Genes
language:en
Short-container-title:Genes

Author:

Ni Yonghui¹,Seffernick Anna Eames¹,Onar-Thomas Arzu¹,Pounds Stanley B.¹

Affiliation:

1. Department of Biostatistics, St. Jude Children’s Research Hospital, Memphis, TN 38105, USA

Abstract

The false discovery rate (FDR) is a widely used metric of statistical significance for genomic data analyses that involve multiple hypothesis testing. Power and sample size considerations are important in planning studies that perform these types of genomic data analyses. Here, we propose a three-rectangle approximation of a p-value histogram to derive a formula to compute the statistical power and sample size for analyses that involve the FDR. We also introduce the R package FDRsamplesize2, which incorporates these and other power calculation formulas to compute power for a broad variety of studies not covered by other FDR power calculation software. A few illustrative examples are provided. The FDRsamplesize2 package is available on CRAN.

Funder

American Lebanese Syrian Associated Charities

Publisher

MDPI AG

Link

https://www.mdpi.com/2073-4425/15/3/344/pdf

Reference24 articles.

1. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing;Benjamini;J. R. Stat. Soc. Ser. B,1995

2. False Discovery Rate;Storey;Int. Encycl. Stat. Sci.,2011

3. A Direct Approach to False Discovery Rates;Storey;J. R. Stat. Soc. Ser. B Stat. Methodol.,2002

4. Estimating the Number of True Null Hypotheses from a Histogram of p Values;Nettleton;J. Agric. Biol. Environ. Stat.,2006

5. Pounds, S.B., Gao, C.L., and Zhang, H. (2012). Empirical Bayesian Selection of Hypothesis Testing Procedures for Analysis of Sequence Count Expression Data. Stat. Appl. Genet. Mol. Biol., 11.