Affiliation:
1. Department of Computer Science, University of Sherbrooke, Sherbrooke, QC J1K 2R1, Canada
Abstract
Comparing phylogenetic trees is a prominent problem widely used in applications such as clustering and building the Tree of Life. While there are many well-developed distance measures for phylogenetic trees defined on the same set of taxa, the situation is contrasting for trees defined on different but mutually overlapping sets of taxa. This paper presents a new polynomial-time algorithm for completing phylogenetic trees and computing the distance between trees defined on different but overlapping sets of taxa. This novel approach considers both the branch lengths and the topology of the phylogenetic trees being compared. We demonstrate that the distance measure applied to completed trees is a metric and provide several properties of the new method, including its symmetrical nature in tree completion.
Funder
Natural Sciences and Engineering Research Council of Canada—Discovery Grants
Canada Graduate Scholarship-Doctoral
Fonds de recherche du Québec-Nature and technologies
University of Sherbrooke grant
Reference40 articles.
1. Comparison of phylogenetic trees;Robinson;Math. Biosci.,1981
2. Briand, S., Dessimoz, C., El-Mabrouk, N., Lafond, M., and Lobinska, G. (2020). A generalized Robinson-Foulds distance for labeled trees. BMC Genom., 21.
3. Information theoretic generalized Robinson–Foulds metrics for comparing phylogenetic trees;Smith;Bioinformatics,2020
4. The Generalized Robinson-Foulds Distance for Phylogenetic Trees;Valiente;J. Comput. Biol.,2021
5. The triples distance for rooted bifurcating phylogenetic trees;Critchlow;Syst. Biol.,1996