Abstract
AbstractPhylogenetic trees are fundamental for understanding evolutionary history. However, finding maximum likelihood trees is challenging due to the complexity of the likelihood landscape and the size of tree space. Based on the Billera-Holmes-Vogtmann (BHV) distance between trees, we describe a method to generate intermediate trees on the shortest path between two trees, called pathtrees. These pathtrees give a structured way to generate and visualize treespace in an area of interest. They allow investigating intermediate regions between trees of interest, exploring locally optimal trees in topological clusters of treespace, and potentially finding trees of high likelihood unexplored by tree search algorithms. We compared our approach against other tree search tools (Paup*, RAxML, and RevBayes) in terms of generated highest likelihood trees, new topology proportions, and consistency of generated treespace. We assess our method using two datasets. The first consists of 23 primate species (CytB, 1141 bp), leading to well-resolved relationships. The second is a dataset of 182 milksnakes (CytB, 1117 bp), containing many similar sequences and complex relationships among individuals. Our method visualizes the treespace using log likelihood as a fitness function. It finds similarly optimal trees as heuristic methods and presents the likelihood landscape at different scales. It revealed that we could find trees that were not found with MCMC methods. The validation measures indicated that our method performed well mapping treespace into lower dimensions. Our method complements heuristic search analyses, and the visualization allows the inspection of likelihood terraces and exploration of treespace areas not visited by heuristic searches.
Publisher
Cold Spring Harbor Laboratory
Reference59 articles.
1. N. Amenta and J. Klingner . Case study: Visualizing sets of evolutionary trees. In IEEE Symposium on Information Visualization, 2002. INFOVIS 2002., pages 71–74. IEEE, 2002.
2. Approximating geodesic tree distance;Inform Process Lett,2007
3. N. Amenta , M. Datar , A. Dirksen , M. d. Bruijne , A. Feragen , X. Ge , J. H. Pedersen , M. Howard , M. Owen , J. Petersen , et al. Quantification and visualization of variation in anatomical trees. In Research in Shape Modeling, pages 57–79. Springer, 2015.
4. Landscapes on spaces of trees
5. Geometry of the Space of Phylogenetic Trees