Relating enhancer genetic variation across mammals to complex phenotypes using machine learning

Author:

Kaplow Irene M.12ORCID,Lawler Alyssa J.23ORCID,Schäffer Daniel E.1ORCID,Srinivasan Chaitanya1ORCID,Sestili Heather H.1ORCID,Wirthlin Morgan E.12ORCID,Phan BaDoi N.124ORCID,Prasad Kavya1ORCID,Brown Ashley R.1ORCID,Zhang Xiaomeng1ORCID,Foley Kathleen5ORCID,Genereux Diane P.67ORCID,Karlsson Elinor K.67ORCID,Lindblad-Toh Kerstin68ORCID,Meyer Wynn K.5ORCID,Pfenning Andreas R.123ORCID,Andrews Gregory,Armstrong Joel C.,Bianchi Matteo,Birren Bruce W.,Bredemeyer Kevin R.,Breit Ana M.,Christmas Matthew J.,Clawson Hiram,Damas Joana,Di Palma Federica,Diekhans Mark,Dong Michael X.,Eizirik Eduardo,Fan Kaili,Fanter Cornelia,Foley Nicole M.,Forsberg-Nilsson Karin,Garcia Carlos J.,Gatesy John,Gazal Steven,Genereux Diane P.,Goodman Linda,Grimshaw Jenna,Halsey Michaela K.,Harris Andrew J.,Hickey Glenn,Hiller Michael,Hindle Allyson G.,Hubley Robert M.,Hughes Graham M.,Johnson Jeremy,Juan David,Kaplow Irene M.,Karlsson Elinor K.,Keough Kathleen C.,Kirilenko Bogdan,Koepfli Klaus-Peter,Korstian Jennifer M.,Kowalczyk Amanda,Kozyrev Sergey V.,Lawler Alyssa J.,Lawless Colleen,Lehmann Thomas,Levesque Danielle L.,Lewin Harris A.,Li Xue,Lind Abigail,Lindblad-Toh Kerstin,Mackay-Smith Ava,Marinescu Voichita D.,Marques-Bonet Tomas,Mason Victor C.,Meadows Jennifer R. S.,Meyer Wynn K.,Moore Jill E.,Moreira Lucas R.,Moreno-Santillan Diana D.,Morrill Kathleen M.,Muntané Gerard,Murphy William J.,Navarro Arcadi,Nweeia Martin,Ortmann Sylvia,Osmanski Austin,Paten Benedict,Paulat Nicole S.,Pfenning Andreas R.,Phan BaDoi N.,Pollard Katherine S.,Pratt Henry E.,Ray David A.,Reilly Steven K.,Rosen Jeb R.,Ruf Irina,Ryan Louise,Ryder Oliver A.,Sabeti Pardis C.,Schäffer Daniel E.,Serres Aitor,Shapiro Beth,Smit Arian F. A.,Springer Mark,Srinivasan Chaitanya,Steiner Cynthia,Storer Jessica M.,Sullivan Kevin A. M.,Sullivan Patrick F.,Sundström Elisabeth,Supple Megan A.,Swofford Ross,Talbot Joy-El,Teeling Emma,Turner-Maier Jason,Valenzuela Alejandro,Wagner Franziska,Wallerman Ola,Wang Chao,Wang Juehan,Weng Zhiping,Wilder Aryn P.,Wirthlin Morgan E.,Xue James R.,Zhang Xiaomeng,

Affiliation:

1. Department of Computational Biology, Carnegie Mellon University, Pittsburgh, PA, USA.

2. Neuroscience Institute, Carnegie Mellon University, Pittsburgh, PA, USA.

3. Department of Biology, Carnegie Mellon University, Pittsburgh, PA, USA.

4. Medical Scientist Training Program, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA.

5. Department of Biological Sciences, Lehigh University, Bethlehem, PA, USA.

6. Broad Institute, Cambridge, MA, USA.

7. Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA.

8. Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden.

Abstract

Protein-coding differences between species often fail to explain phenotypic diversity, suggesting the involvement of genomic elements that regulate gene expression such as enhancers. Identifying associations between enhancers and phenotypes is challenging because enhancer activity can be tissue-dependent and functionally conserved despite low sequence conservation. We developed the Tissue-Aware Conservation Inference Toolkit (TACIT) to associate candidate enhancers with species’ phenotypes using predictions from machine learning models trained on specific tissues. Applying TACIT to associate motor cortex and parvalbumin-positive interneuron enhancers with neurological phenotypes revealed dozens of enhancer–phenotype associations, including brain size–associated enhancers that interact with genes implicated in microcephaly or macrocephaly. TACIT provides a foundation for identifying enhancers associated with the evolution of any convergently evolved phenotype in any large group of species with aligned genomes.

Publisher

American Association for the Advancement of Science (AAAS)

Subject

Multidisciplinary

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3