Abstract
AbstractAs genomic sequencing becomes more accurate and less costly, large cohorts and consortiums of cohorts are providing high power for rare variant association studies for many conditions. When large sample sizes are not attainable and the phenotype under study is continuous, an extreme phenotypes design can provide high statistical power with a small to moderate sample size. We extend the extreme phenotypes design to the dichotomous infectious disease outcome by sampling on extremes of the pathogenic exposure instead of sampling on extremes of phenotype. We use a likelihood ratio test (LRT) to test the significance of association between infection status and presence of susceptibility rare variants. More than 10 billion simulations are studied to assess the method. The method results in high sample enrichment for rare variants affecting susceptibility. Greater than 90% power to detect rare variant associations is attained in reasonable scenarios. The ordinary case-control design requires orders of magnitude more samples to achieve the same power. The Type I error rate of the LRT is accurate even for p-values < 10-7. We find that erroroneous exposure assessment can lead to power loss more severe than excluding the observations with errors. Nevertheless, careful sampling on exposure extremes can make a study feasible by providing adequate statistical power. Limitations of this method are not unique to this design, and the power is never less than that of the ordinary case-control design. The method applies without modification to other dichotomous outcomes that have strong association with a continuous covariate.
Publisher
Cold Spring Harbor Laboratory
Reference25 articles.
1. Rare coding variants in PLCG2, ABI3, and TREM2 implicate microglial-mediated innate immunity in Alzheimer's disease
2. COVID-19 Host Genetics Initiative. Mapping the human genetic architecture of COVID-19. Nature [Internet]. 2021 Jul 8 [cited 2021 Jul 26]; Available from: http://www.nature.com/articles/s41586-021-03767-x
3. Mapping quantitative trait loci with extreme discordant sib pairs: sampling considerations;Am J Hum Genet,1996
4. Discriminative Ability of LDL-Cholesterol to Identify Patients With Familial Hypercholesterolemia