Disjoint Tree Mergers for Large-Scale Maximum Likelihood Tree Estimation-Reference-Cited by-同舟云学术

Disjoint Tree Mergers for Large-Scale Maximum Likelihood Tree Estimation

Published:2021-05-07 Issue:5 Volume:14 Page:148
ISSN:1999-4893
Container-title:Algorithms
language:en
Short-container-title:Algorithms

Author:

Park Minhyuk^ORCID,Zaharias Paul^ORCID,Warnow Tandy^ORCID

Abstract

The estimation of phylogenetic trees for individual genes or multi-locus datasets is a basic part of considerable biological research. In order to enable large trees to be computed, Disjoint Tree Mergers (DTMs) have been developed; these methods operate by dividing the input sequence dataset into disjoint sets, constructing trees on each subset, and then combining the subset trees (using auxiliary information) into a tree on the full dataset. DTMs have been used to advantage for multi-locus species tree estimation, enabling highly accurate species trees at reduced computational effort, compared to leading species tree estimation methods. Here, we evaluate the feasibility of using DTMs to improve the scalability of maximum likelihood (ML) gene tree estimation to large numbers of input sequences. Our study shows distinct differences between the three selected ML codes—RAxML-NG, IQ-TREE 2, and FastTree 2—and shows that good DTM pipeline design can provide advantages over these ML codes on large datasets.

Funder

National Science Foundation

Publisher

MDPI AG

Subject

Computational Mathematics,Computational Theory and Mathematics,Numerical Analysis,Theoretical Computer Science

Link

https://www.mdpi.com/1999-4893/14/5/148/pdf

Reference37 articles.

1. Some probabilistic and statistical problems in the analysis of DNA sequences;Tavaré;Lect. Math. Life Sci.,1986

2. A Short Proof that Phylogenetic Tree Reconstruction by Maximum Likelihood Is Hard

3. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies