Modeling islet enhancers using deep learning identifies candidate causal variants at loci associated with T2D and glycemic traits

Author:

Hudaiberdiev Sanjarbek1,Taylor D. Leland2ORCID,Song Wei1,Narisu Narisu2,Bhuiyan Redwan M.34,Taylor Henry J.25ORCID,Tang Xuming67,Yan Tingfen2,Swift Amy J.2,Bonnycastle Lori L.2,Consortium DIAMANTE,Chen Shuibing67ORCID,Stitzel Michael L.348,Erdos Michael R.2,Ovcharenko Ivan1,Collins Francis S.2ORCID

Affiliation:

1. Computational Biology Branch, National Center for Biotechnology Information, National Library of Medicine, NIH, Bethesda, MD 20892

2. Center for Precision Health Research, National Human Genome Research Institute, NIH, Bethesda, MD 20892

3. The Jackson Laboratory for Genomic Medicine, Farmington, CT 06032

4. Department of Genetics and Genome Sciences, University of Connecticut, Farmington, CT 06032

5. British Heart Foundation Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge CB1 8RN, UK

6. Department of Surgery, Weill Cornell Medicine, New York, NY 10065

7. Center for Genomic Health, Weill Cornell Medicine, New York, NY 10065

8. Institute of Systems Genomics, University of Connecticut, Farmington, CT 06032

Abstract

Genetic association studies have identified hundreds of independent signals associated with type 2 diabetes (T2D) and related traits. Despite these successes, the identification of specific causal variants underlying a genetic association signal remains challenging. In this study, we describe a deep learning (DL) method to analyze the impact of sequence variants on enhancers. Focusing on pancreatic islets, a T2D relevant tissue, we show that our model learns islet-specific transcription factor (TF) regulatory patterns and can be used to prioritize candidate causal variants. At 101 genetic signals associated with T2D and related glycemic traits where multiple variants occur in linkage disequilibrium, our method nominates a single causal variant for each association signal, including three variants previously shown to alter reporter activity in islet-relevant cell types. For another signal associated with blood glucose levels, we biochemically test all candidate causal variants from statistical fine-mapping using a pancreatic islet beta cell line and show biochemical evidence of allelic effects on TF binding for the model-prioritized variant. To aid in future research, we publicly distribute our model and islet enhancer perturbation scores across ~67 million genetic variants. We anticipate that DL methods like the one presented in this study will enhance the prioritization of candidate causal variants for functional studies.

Funder

HHS | National Institutes of Health

U.S. Department of Defense

HHS | NIH | National Institute of Diabetes and Digestive and Kidney Diseases

Publisher

Proceedings of the National Academy of Sciences

Subject

Multidisciplinary

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3