Abstract
ABSTRACTGenome-wide association studies (GWAS) identify genomic loci associated with complex traits, but it remains an open challenge to identify the genes underlying the association signals. Here, we extend the equations of statistical fine-mapping, to compute the probability that each gene in the human genome is targeted by a causal variant, given a particular trait. Our computations are enabled by several key innovations. First, we partition the genome into optimal linkage disequilibrium blocks, enabling genome-wide detection of trait-associated genes. Second, we unveil a comprehensive mapping that associates genetic variants to the target genes they affect. The combined performance of the map on high-throughput functional genomics and eQTL datasets supersedes the state of the art. Lastly, we describe an algorithm which learns, directly from GWAS data, how to incorporate prior knowledge into the statistical computations, significantly improving their accuracy. We validate each component of the statistical framework individually and in combination. Among methods to identify genes targeted by causal variants, this paradigm rediscovers an unprecedented proportion of known disease genes. Moreover, it establishes human genetics support for many genes previously implicated only by clinical or preclinical evidence, and it discovers an abundance of novel disease genes with compelling biological rationale.
Publisher
Cold Spring Harbor Laboratory
Reference172 articles.
1. Akbari, P. , Gilani, A. , Sosina, O. , Kosmicki, J. A. , Khrimian, L. , Feng, Y.-Y. , Persaud, T. , Garcia, V. , Sun, D. , Li, A. et al. (2021). Sequencing of 640,000 exomes identifies GPR75 variants associated with protection from obesity. Science 373.
2. Genome-Wide Association Study of Susceptibility to Idiopathic Pulmonary Fibrosis;Am. J. Respir. Crit. Care Med,2020
3. An atlas of active enhancers across human cell types and tissues;Nature,2014
4. Auton, A. (2013). Genetic maps generated from the 1000G Phase 1 phased OMNI data. ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/technical/working/20130507_omni_recombination_rates/.
5. Barbeira, A. N. , Dickinson, S. P. , Bonazzola, R. , Zheng, J. , Wheeler, H. E. , Torres, J. M. , Torstenson, E. S. , Shah, K. P. , Garcia, T. , Edwards, T. L. et al. (2018). Exploring the phenotypic consequences of tissue specific gene expression variation inferred from GWAS summary statistics. Nat. Commun. 9.