Affiliation:
1. Department of Mathematics and Statistics, Wright State University, Dayton, OH 45324, USA
Abstract
Association testing has been widely used to study the relationship between phenotypes and genetic variants. Most testing methods are based on genotypes. To avoid genotype calling and directly test on next-generation sequencing (NGS) data, sequencing data-based methods have been proposed and shown advantages over genotype-based testing methods in scenarios where genotype calling is inaccurate. Most sequencing data-based testing methods are based on a single genetic marker. The objective of this paper is to extend the methods to allow testing for the association of a continuous response variable with a group of common variants or a group of rare variants without genotype calling. Our proposed methods are derived based on a standard linear model framework. We derive the joint significant test (JS) for a group of common genetic variables and the variable collapse test (VC) for a group of rare genetic variables. We have conducted extensive simulation studies to evaluate the performance of different estimators. According to our results, we found (1) all methods, including our proposed NGS data-based methods and genotype-based methods, can control the Type I error rate probability well; (2) our proposed NGS data-based methods can achieve better performance in terms of statistical power compared with their corresponding genotype-based methods in the literature; (3) when sequencing depth increases, the performance of all methods increases, and the difference between the performance of NGS data-based methods and corresponding genotype-based methods decreases. In conclusion, we have proposed NGS data-based methods that allow testing for the significance of a group of variants using a linear model framework and have shown the advantage of our NGS data-based methods over genotype-based methods in the literature.
Subject
General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)
Reference51 articles.
1. Men, A.E., Wilson, P., Siemering, K., and Forrest, S. (2008). Next Generation Genome Sequencing: Towards Personalized Medicine, John Wiley & Sons.
2. Illumina_Inc. (2023, January 15). DNA Sequencing with Solexa® Technology. Available online: https://courses.cs.duke.edu/spring21/compsci260/resources/GenomeSequencingTechnology/Illumina.Solexa.sequencing.pdf.
3. Wall, P.K., Leebens-Mack, J., Chanderbali, A.S., Barakat, A., Wolcott, E., Liang, H., Landherr, L., Tomsho, L.P., Hu, Y., and Carlson, J.E. (2009). Comparison of next generation sequencing technologies for transcriptome characterization. BMC Genom., 10.
4. Next-generation sequencing platforms;Mardis;Annu. Rev. Anal. Chem.,2013
5. Next-generation DNA sequencing;Shendure;Nat. Biotechnol.,2008
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献