A Bayesian method for estimating gene‐level polygenicity under the framework of transcriptome‐wide association study-Reference-Cited by-同舟云学术

A Bayesian method for estimating gene‐level polygenicity under the framework of transcriptome‐wide association study

Published:2023-08-29 Issue:26 Volume:42 Page:4867-4885
ISSN:0277-6715
Container-title:Statistics in Medicine
language:en
Short-container-title:Statistics in Medicine

Author:

Majumdar Arunabha¹^ORCID,Pasaniuc Bogdan²

Affiliation:

1. Department of Mathematics Indian Institute of Technology Hyderabad Kandi Telangana India

2. Department of Pathology and Laboratory Medicine University of California, Los Angeles Los Angeles California

Abstract

Polygenicity refers to the phenomenon that multiple genetic variants have a nonzero effect on a complex trait. It is defined as the proportion of genetic variants with a nonzero effect on the trait. Evaluation of polygenicity can provide valuable insights into the genetic architecture of the trait. Several recent works have attempted to estimate polygenicity at the single nucleotide polymorphism level. However, evaluating polygenicity at the gene level can be biologically more meaningful. We propose the notion of gene‐level polygenicity, defined as the proportion of genes having a nonzero effect on the trait under the framework of a transcriptome‐wide association study. We introduce a Bayesian approach genepoly to estimate this quantity for a trait. The method is based on spike and slab prior and simultaneously estimates the subset of non‐null genes. Our simulation study shows that genepoly efficiently estimates gene‐level polygenicity. The method produces a downward bias for small choices of trait heritability due to a non‐null gene, which diminishes rapidly with an increase in the genome‐wide association study (GWAS) sample size. While identifying the subset of non‐null genes, genepoly offers a high level of specificity and an overall good level of sensitivity—the sensitivity increases as the sample size of the reference panel expression and GWAS data increase. We applied the method to seven phenotypes in the UK Biobank, integrating expression data. We find height to be the most polygenic and asthma to be the least polygenic.

Publisher

Wiley

Subject

Statistics and Probability,Epidemiology

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1002/sim.9892

Reference30 articles.

1. Estimation of complex effect-size distributions using summary-level statistics from genome-wide association studies across 32 complex traits

2. Projecting the performance of risk prediction based on polygenic analyses of genome-wide association studies

3. Extreme Polygenicity of Complex Traits Is Explained by Negative Selection

4. Estimation of regional polygenicity from GWAS provides insights into the genetic architecture of complex traits

5. Opportunities and challenges for transcriptome-wide association studies