Integrating gene annotation with orthology inference at scale

Author:

Kirilenko Bogdan M.12345ORCID,Munegowda Chetan12345ORCID,Osipova Ekaterina12345ORCID,Jebb David123,Sharma Virag123ORCID,Blumer Moritz123ORCID,Morales Ariadna E.456ORCID,Ahmed Alexis-Walid456,Kontopoulos Dimitrios-Georgios456ORCID,Hilgers Leon456ORCID,Lindblad-Toh Kerstin78ORCID,Karlsson Elinor K.8910ORCID,Hiller Michael12345ORCID,Andrews Gregory,Armstrong Joel C.,Bianchi Matteo,Birren Bruce W.,Bredemeyer Kevin R.,Breit Ana M.,Christmas Matthew J.,Clawson Hiram,Damas Joana,Di Palma Federica,Diekhans Mark,Dong Michael X.,Eizirik Eduardo,Fan Kaili,Fanter Cornelia,Foley Nicole M.,Forsberg-Nilsson Karin,Garcia Carlos J.,Gatesy John,Gazal Steven,Genereux Diane P.,Goodman Linda,Grimshaw Jenna,Halsey Michaela K.,Harris Andrew J.,Hickey Glenn,Hiller Michael,Hindle Allyson G.,Hubley Robert M.,Hughes Graham M.,Johnson Jeremy,Juan David,Kaplow Irene M.,Karlsson Elinor K.,Keough Kathleen C.,Kirilenko Bogdan,Koepfli Klaus-Peter,Korstian Jennifer M.,Kowalczyk Amanda,Kozyrev Sergey V.,Lawler Alyssa J.,Lawless Colleen,Lehmann Thomas,Levesque Danielle L.,Lewin Harris A.,Li Xue,Lind Abigail,Lindblad-Toh Kerstin,Mackay-Smith Ava,Marinescu Voichita D.,Marques-Bonet Tomas,Mason Victor C.,Meadows Jennifer R. S.,Meyer Wynn K.,Moore Jill E.,Moreira Lucas R.,Moreno-Santillan Diana D.,Morrill Kathleen M.,Muntané Gerard,Murphy William J.,Navarro Arcadi,Nweeia Martin,Ortmann Sylvia,Osmanski Austin,Paten Benedict,Paulat Nicole S.,Pfenning Andreas R.,Phan BaDoi N.,Pollard Katherine S.,Pratt Henry E.,Ray David A.,Reilly Steven K.,Rosen Jeb R.,Ruf Irina,Ryan Louise,Ryder Oliver A.,Sabeti Pardis C.,Schäffer Daniel E.,Serres Aitor,Shapiro Beth,Smit Arian F. A.,Springer Mark,Srinivasan Chaitanya,Steiner Cynthia,Storer Jessica M.,Sullivan Kevin A. M.,Sullivan Patrick F.,Sundström Elisabeth,Supple Megan A.,Swofford Ross,Talbot Joy-El,Teeling Emma,Turner-Maier Jason,Valenzuela Alejandro,Wagner Franziska,Wallerman Ola,Wang Chao,Wang Juehan,Weng Zhiping,Wilder Aryn P.,Wirthlin Morgan E.,Xue James R.,Zhang Xiaomeng,

Affiliation:

1. Max Planck Institute of Molecular Cell Biology and Genetics, 01307 Dresden, Germany.

2. Max Planck Institute for the Physics of Complex Systems, 01187 Dresden, Germany.

3. Center for Systems Biology Dresden, 01307 Dresden, Germany.

4. LOEWE Centre for Translational Biodiversity Genomics, 60325 Frankfurt, Germany.

5. Senckenberg Research Institute, 60325 Frankfurt, Germany.

6. Goethe University Frankfurt, Faculty of Biosciences, 60438 Frankfurt, Germany.

7. Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, Uppsala University, 751 32 Uppsala, Sweden.

8. Broad Institute of MIT and Harvard, Cambridge, MA 02139, USA.

9. Program in Bioinformatics and Integrative Biology, UMass Chan Medical School, Worcester, MA 01605, USA.

10. Program in Molecular Medicine, UMass Chan Medical School, Worcester, MA 01605, USA.

Abstract

Annotating coding genes and inferring orthologs are two classical challenges in genomics and evolutionary biology that have traditionally been approached separately, limiting scalability. We present TOGA (Tool to infer Orthologs from Genome Alignments), a method that integrates structural gene annotation and orthology inference. TOGA implements a different paradigm to infer orthologous loci, improves ortholog detection and annotation of conserved genes compared with state-of-the-art methods, and handles even highly fragmented assemblies. TOGA scales to hundreds of genomes, which we demonstrate by applying it to 488 placental mammal and 501 bird assemblies, creating the largest comparative gene resources so far. Additionally, TOGA detects gene losses, enables selection screens, and automatically provides a superior measure of mammalian genome quality. TOGA is a powerful and scalable method to annotate and compare genes in the genomic era.

Publisher

American Association for the Advancement of Science (AAAS)

Subject

Multidisciplinary

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3