Optimizing murine sample sizes for RNA-seq studies revealed from large-scale comparative analysis

Author:

Halasz Gabor,Schmahl Jennifer,Negron Nicole,Ni Min,Atwal Gurinder S.,Glass David J.ORCID

Abstract

ABSTRACTDetermining the appropriate sample size (N) for comparative biological experiments is critical for obtaining reliable results. In order to determine the N, the usual approach is to perform a power calculation, which involves understanding the variability between samples and the expected effect size. Here, we focused on bulk RNA-seq experiments, which have become ubiquitous in biology, but which have many unknown or difficult to estimate parameters, and so the required analyses to determine the minimum N is typically lacking. We therefore performed two N=30 profiling studies between wild-type mice and mice in which one copy of a gene had been deleted, to determine how many mice would be required to minimize false positives and to maximize true discoveries found in the N of 30 experiment. Results from experiments with N=4 or less are shown to be highly misleading, given the substantial false positive rate, and the lack of discovery of genes later found with higher N. For a cut-off of 2-fold expression differences, we found that an N of 6-7 mice was required to consistently decrease the false positive rate to below 50%, and that “more is always better” when it came to discovery rates - an N of 8-12 is significantly better in lowering the false positive rate.A common method to reduce false discovery rate in underpowered experiments is to raise the fold cutoff or increase the stringency of the P-value and include only highly perturbed, highly significant genes. We show that while this strategy is no substitute for increasing the N of the experiment, because it results in consistently inflated effect sizes and a substantial drop in sensitivity. These data should be helpful to others in choosing their Ns, since it’s often not practical to do such large studies for every mouse model.

Publisher

Cold Spring Harbor Laboratory

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3