Affiliation:
1. Department of Biostatistics and Epidemiology, School of Health Isfahan University of Medical Sciences Isfahan Iran
2. Cellular and Molecular Endocrine Research Center, Research Institute for Endocrine Sciences Shahid Beheshti University of Medical Sciences Tehran Iran
3. Department of Statistics, Faculty of Mathematics & Statistics University of Isfahan Isfahan Iran
4. Bioinformatics Research Centre, Institute of Advanced Studies, Quantitative Genetics and Genomics Aarhus University Aarhus Denmark
Abstract
AbstractObjectiveThe genome‐wide association studies (GWAS) analysis, the most successful technique for discovering disease‐related genetic variation, has some statistical concerns, including multiple testing, the correlation among variants (single‐nucleotide polymorphisms) based on linkage disequilibrium and omitting the important variants when fitting the model with just one variant. To eliminate these problems in a small sample‐size study, we used a sparse Bayesian learning model for finding bipolar disorder (BD) genetic variants.MethodsThis study used the Wellcome Trust Case Control Consortium data set, including 1998 BD cases and 1500 control samples, and after quality control, 380,628 variants were analysed. In this GWAS, a Bayesian logistic model with hierarchical shrinkage spike and slab priors was used, with all variants considered simultaneously in one model. In order to decrease the computational burden, an alternative inferential method, Bayesian variational inference, has been used.ResultsThirteen variants were selected as associated with BD. The three of them (rs7572953, rs1378850 and rs4148944) were reported in previous GWAS. Eight of which were related to hemogram parameters, such as lymphocyte percentage, plateletcrit and haemoglobin concentration. Among selected related genes, GABPA, ELF3 and JAM2 were enriched in the platelet‐derived growth factor pathway. These three genes, along with APP, ARL8A, CDH23 and GPR37L1, could be differential diagnostic variants for BD.ConclusionsBy reducing the statistical restrictions of GWAS analysis, the application of the Bayesian variational spike and slab models can offer insight into the genetic link with BD even with a small sample size. To uncover related variations with other traits, this model needs to be further examined.
Reference44 articles.
1. Genetic Mapping in Human Disease
2. Bipolar disorder
3. Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls
4. Carbonetto P. Zhou X. &Stephens M.(2017).varbvs: Fast variable selection for large‐scale regression. Journal of Statistical Software arXiv preprint arXiv:1709.06597.