Affiliation:
1. Department of Mathematical Sciences, Michigan Technological University, Houghton, MI 49931, USA
Abstract
Genome-wide association studies (GWAS) have successfully revealed many disease-associated genetic variants. For a case-control study, the adequate power of an association test can be achieved with a large sample size, although genotyping large samples is expensive. A cost-effective strategy to boost power is to integrate external control samples with publicly available genotyped data. However, the naive integration of external controls may inflate the type I error rates if ignoring the systematic differences (batch effect) between studies, such as the differences in sequencing platforms, genotype-calling procedures, population stratification, and so forth. To account for the batch effect, we propose an approach by integrating External Controls into the Association Test by Regression Calibration (iECAT-RC) in case-control association studies. Extensive simulation studies show that iECAT-RC not only can control type I error rates but also can boost statistical power in all models. We also apply iECAT-RC to the UK Biobank data for M72 Fibroblastic disorders by considering genotype calling as the batch effect. Four SNPs associated with fibroblastic disorders have been detected by iECAT-RC and the other two comparison methods, iECAT-Score and Internal. However, our method has a higher probability of identifying these significant SNPs in the scenario of an unbalanced case-control association study.
Subject
Genetics (clinical),Genetics
Reference32 articles.
1. Price, A.L., Spencer, C.C., and Donnelly, P. (2015). Progress and promise in understanding the genetic basis of common diseases. Proc. R. Soc. B Biol. Sci., 282.
2. Detecting association of rare and common variants by testing an optimally weighted combination of variants;Sha;Genet. Epidemiol.,2012
3. 10 years of GWAS discovery: Biology, function, and translation;Visscher;Am. J. Hum. Genet.,2017
4. Genome-wide association studies for common diseases and complex traits;Hirschhorn;Nat. Rev. Genet.,2005
5. Literature reviews on methods for rare variant association studies;Fang;Hum. Genet. Embryol.,2016