SpeciesRax: A tool for maximum likelihood species tree inference from gene family trees under duplication, transfer, and loss

Author:

Morel BenoitORCID,Schade Paul,Lutteropp Sarah,Williams Tom A.ORCID,Szöllősi Gergely J.ORCID,Stamatakis AlexandrosORCID

Abstract

AbstractSpecies tree inference from gene family trees is becoming increasingly popular because it can account for discordance between the species tree and the corresponding gene family trees. In particular, methods that can account for multiple-copy gene families exhibit potential to leverage paralogy as informative signal. At present, there does not exist any widely adopted inference method for this purpose. Here, we present SpeciesRax, the first maximum likelihood method that can infer a rooted species tree from a set of gene family trees and can account for gene duplication, loss, and transfer events. By explicitly modelling events by which gene trees can depart from the species tree, SpeciesRax leverages the phylogenetic rooting signal in gene trees. SpeciesRax infers species tree branch lengths in units of expected substitutions per site and branch support values via paralogy-aware quartets extracted from the gene family trees. Using both empirical and simulated datasets we show that SpeciesRax is at least as accurate as the best competing methods while being one order of magnitude faster on large datasets at the same time. We used SpeciesRax to infer a biologically plausible rooted phylogeny of the vertebrates comprising 188 species from 31612 gene families in one hour using 40 cores. SpeciesRax is available under GNU GPL at https://github.com/BenoitMorel/GeneRax and on BioConda.

Publisher

Cold Spring Harbor Laboratory

Reference79 articles.

1. ExaBayes: Massively Parallel Bayesian Tree Inference for the Whole-Genome Era

2. Altenhoff, A.M. , Glover, N.M. , and Dessimoz, C. 2019. Inferring Orthology and Paralogy, pages 149–175. Springer New York, New York, NY.

3. The Amborella Genome and the Evolution of Flowering Plants

4. Bayzid, M. , Mirarab, S. , and Warnow, T. 2013. Inferring optimal species trees under gene duplication and loss. Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing, pages 250–261. 18th Pacific Symposium on Biocomputing, PSB 2013; Conference date: 03-01-2013 Through 07-01-2013.

5. Betancur-R, R. , Broughton, R.E. , Wiley, E.O. , Carpenter, K. , Lopez, J.A. , Li, C. , Holcroft, N.I. , Arcila, D. , Sanciangco, M. , Cureton Ii , J.C., et al. 2013. The tree of life and a new classification of bony fishes. PLoS currents, 5.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3