Random rotation for identifying differentially expressed genes with linear models following batch effect correction-Reference-Cited by-同舟云学术

Random rotation for identifying differentially expressed genes with linear models following batch effect correction

Published:2021-02-01 Issue:15 Volume:37 Page:2142-2149
ISSN:1367-4803
Container-title:Bioinformatics
language:en
Short-container-title:

Author:

Hettegger Peter¹^ORCID,Vierlinger Klemens¹,Weinhaeusel Andreas¹

Affiliation:

1. Competence Unit Molecular Diagnostics, Health and Environment Department, Austrian Institute of Technology, Vienna 1220, Austria

Abstract

Abstract Motivation Data generated from high-throughput technologies such as sequencing, microarray and bead-chip technologies are unavoidably affected by batch effects (BEs). Large effort has been put into developing methods for correcting these effects. Often, BE correction and hypothesis testing cannot be done with one single model, but are done successively with separate models in data analysis pipelines. This potentially leads to biased P-values or false discovery rates due to the influence of BE correction on the data. Results We present a novel approach for estimating null distributions of test statistics in data analysis pipelines where BE correction is followed by linear model analysis. The approach is based on generating simulated datasets by random rotation and thereby retains the dependence structure of genes adequately. This allows estimating null distributions of dependent test statistics, and thus the calculation of resampling-based P-values and false-discovery rates following BE correction while maintaining the alpha level. Availability The described methods are implemented as randRotation package on Bioconductor: https://bioconductor.org/packages/randRotation/ Supplementary information Supplementary data are available at Bioinformatics online.

Publisher

Oxford University Press (OUP)

Subject

Computational Mathematics,Computational Theory and Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Statistics and Probability

Link

http://academic.oup.com/bioinformatics/advance-article-pdf/doi/10.1093/bioinformatics/btab063/38504076/btab063.pdf

Reference42 articles.

1. Permutation tests for univariate or multivariate analysis of variance and regression;Anderson;Canadian J. Fish. Aquat. Sci,2001

2. Controlling the false discovery rate: a practical and powerful approach to multiple testing;Benjamini;J. R. Stat. Soc. B,1995

3. The control of the false discovery rate in multiple testing under depencency;Benjamini;Ann. Stat,2001

4. Rotation testing in gene set enrichment analysis for small direct comparison experiments;Dørum;Stat. Appl. Genet. Mol. Biol,2009

5. Bootstrap methods: another look at the Jackknife;Efron;Ann. Stat,1979