Affiliation:
1. College of Science, Nanjing Agricultural University, Nanjing 210095, China
Abstract
Genome-wide association studies (GWASs) are flexible and comprehensive tools for identifying single nucleotide polymorphisms (SNPs) associated with complex traits or diseases. The whole-genome Bayesian models are an effective way of incorporating important prior information into modeling. Bayesian methods have been widely used in association analysis. However, Bayesian analysis is often not feasible due to the high-throughput genotype and large sample sizes involved. In this study, we propose a new Bayesian algorithm under the mixed linear model framework: the expectation and maximization BayesB Improved algorithm (emBBI). The emBBI algorithm corrects polygenic and environmental noise and reduces dimensions; then, it estimates and tests marker effects using emBayesB and the LOD test, respectively. We conducted two simulation experiments and analyzed a real dataset related to flowering time in Arabidopsis to demonstrate the validation of the new algorithm. The results show that the emBBI algorithm is more flexible and accurate in simulation studies compared to established methods, and it performs well under complex genetic backgrounds. The analysis of the Arabidopsis real dataset further illustrates the advantages of the emBBI algorithm for GWAS by detecting known genes. Furthermore, 12 candidate genes are identified in the neighborhood of the significant quantitative trait nucleotides (QTNs) of flowering-related QTNs in Arabidopsis. In addition, we also performed enrichment analysis and tissue expression analysis of candidate genes, which will help us better understand the genetic basis of flowering-related traits in Arabidopsis.
Funder
Innovation and Entrepreneurship Program of the Nanjing Agriculture University
Reference45 articles.
1. Genome-wide association studies;Uffelmann;Nat. Rev. Methods Primers,2021
2. 10 Years of GWAS Discovery: Biology, Function, and Translation;Visscher;Am. J. Hum. Genet.,2017
3. A common variant in the FTO gene is associated with body mass index and predisposes to childhood and adult obesity;Frayling;Science,2007
4. Diverse Genome-wide Association Studies Associate the IL12/IL23 Pathway with Crohn Disease;Wang;Am. J. Hum. Genet.,2009
5. Ma, J.W., Yang, J., Zhou, L.S., Ren, J., Liu, X.X., Zhang, H., Yang, B., Zhang, Z.Y., Ma, H.B., and Xie, X.H. (2014). A Splice Mutation in the Gene Causes High Glycogen Content and Low Meat Quality in Pig Skeletal Muscle. PLoS Genet., 10.