Batch effect reduction of microarray data with dependent samples using an empirical Bayes approach (BRIDGE)
Author:
Xia Qing1, Thompson Jeffrey A.1, Koestler Devin C.1
Affiliation:
1. Department of Biostatistics & Data Science , University of Kansas Medical Center , 3901 Rainbow Blvd. , Kansas City , KS 66160 , USA
Abstract
Abstract
Batch-effects present challenges in the analysis of high-throughput molecular data and are particularly problematic in longitudinal studies when interest lies in identifying genes/features whose expression changes over time, but time is confounded with batch. While many methods to correct for batch-effects exist, most assume independence across samples; an assumption that is unlikely to hold in longitudinal microarray studies. We propose Batch effect Reduction of mIcroarray data with Dependent samples usinG
Empirical Bayes (BRIDGE), a three-step parametric empirical Bayes approach that leverages technical replicate samples profiled at multiple timepoints/batches, so-called “bridge samples”, to inform batch-effect reduction/attenuation in longitudinal microarray studies. Extensive simulation studies and an analysis of a real biological data set were conducted to benchmark the performance of BRIDGE against both ComBat and longitudinal
ComBat. Our results demonstrate that while all methods perform well in facilitating accurate estimates of time effects, BRIDGE outperforms both ComBat and longitudinal ComBat in the removal of batch-effects in data sets with bridging samples, and perhaps as a result, was observed to have improved statistical power for detecting genes with a time effect. BRIDGE demonstrated competitive performance in batch effect reduction of confounded longitudinal microarray studies, both in simulated and a real data sets, and may serve as a useful preprocessing method for researchers conducting longitudinal microarray studies that include bridging samples.
Publisher
Walter de Gruyter GmbH
Subject
Computational Mathematics,Genetics,Molecular Biology,Statistics and Probability
Reference28 articles.
1. Akey, J.M., Biswas, S., Leek, J.T., and Storey, J.D. (2007). On the design and analysis of gene expression studies in human populations. Nat. Genet. 39: 807–808. https://doi.org/10.1038/ng0707-807. 2. Baggerly, K.A., Edmonson, S.R., Morris, J.S., and Coombes, K.R. (2004). High-resolution serum proteomic patterns for ovarian cancer detection. Endocr. Relat. Cancer 11: 583–585. https://doi.org/10.1677/erc.1.00868. 3. Banchereau, R., Hong, S., Cantarel, B., Baldwin, N., Baisch, J., Edens, M., Cepika, A.-M., Acs, P., Turner, J., and Anguiano, E. (2016). Personalized immunomonitoring uncovers molecular networks that stratify lupus patients. Cell 165: 551–565. https://doi.org/10.1016/j.cell.2016.03.008. 4. Beer, J.C., Tustison, N.J., Cook, P.A., Davatzikos, C., Sheline, Y.I., Shinohara, R.T., and Linn, K.A. (2020). Longitudinal ComBat: a method for harmonizing longitudinal multi-scanner imaging data. Neuroimage 220: 117129. https://doi.org/10.1016/j.neuroimage.2020.117129. 5. Bjornsson, H.T., Sigurdsson, M.I., Fallin, M.D., Irizarry, R.A., Aspelund, T., Cui, H., Yu, W., Rongione, M.A., Ekstrom, T.J., Harris, T.B., et al.. (2008). Intra-individual change over time in DNA methylation with familial clustering. J. Am. Med. Assoc. 299: 2877–2883. https://doi.org/10.1001/jama.299.24.2877.
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|