Abstract
Background
Gene set tests can pinpoint genes and biological pathways that exert small to moderate effects on complex diseases like Type 2 Diabetes (T2D). By aggregating genetic markers based on biological information, these tests can enhance the statistical power needed to detect genetic associations.
Results
Our goal was to develop a gene set test utilizing Bayesian Linear Regression (BLR) models, which account for both linkage disequilibrium (LD) and the complex genetic architectures intrinsic to diseases, thereby increasing the detection power of genetic associations. Through a series of simulation studies, we demonstrated how the efficacy of BLR derived gene set tests is influenced by several factors, including the proportion of causal markers, the size of gene sets, the percentage of genetic variance explained by the gene set, and the genetic architecture of the traits. By using KEGG pathways, eQTLs, and regulatory elements as different kinds of gene sets with T2D results, we also assessed the performance of gene set tests in explaining more about real phenotypes.
Conclusions
Comparing our method with other approaches, such as the gold standard MAGMA (Multi-marker Analysis of Genomic Annotation) approach, our BLR gene set test showed superior performance. Combining performance of our method in simulated and real phenotypes, this suggests that our BLR-based approach could more accurately identify genes and biological pathways underlying complex diseases.