Affiliation:
1. Département d’Informatique et de Recherche Opérationnelle, Université de Montréal, 2920 Chemin de la Tour, Montréal, QC H3T 1J4, Canada
Abstract
We present Synesth, the most comprehensive and flexible tool for tree reconciliation that allows for events on syntenies (i.e., on sets of multiple genes), including duplications, transfers, fissions, and transient events going through unsampled species. This model allows for building histories that explicate the inconsistencies between a synteny tree and its associated species tree. We examine the combinatorial properties of this extended reconciliation model and study various associated parsimony problems. First, the infinite set of explicatory histories is reduced to a finite but exponential set of Pareto-optimal histories (in terms of counts of each event type), then to a polynomial set of Pareto-optimal event count vectors, and this eventually ends with minimum event cost histories given an event cost function. An inductive characterization of the solution space using different algebras for each granularity leads to efficient dynamic programming algorithms, ultimately ending with an O(mn) time complexity algorithm for computing the cost of a minimum-cost history (m and n: number of nodes in the input synteny and species trees). This time complexity matches that of the fastest known algorithms for classical gene reconciliation with transfers. We show how Synesth can be applied to infer Pareto-optimal evolutionary scenarios for CRISPR-Cas systems in a set of bacterial genomes.
Funder
Natural Sciences and Engineering Research Council of Canada
Fonds de recherche du Québec—Nature et technologies
Reference22 articles.
1. Fitting the gene lineage into its species lineage, a parsimony strategy illustrated by cladograms constructed from globin sequences;Goodman;Syst. Biol.,1979
2. Efficient algorithms for the reconciliation problem with gene duplication, horizontal transfer and loss;Bansal;Bioinformatics,2012
3. Donati, B., Baudet, C., Sinaimeri, B., Crescenzi, P., and Sagot, M.F. (2015). EUCALYPT: Efficient tree reconciliation enumerator. Algorithms Mol. Biol., 10.
4. Simultaneous identification of duplications and lateral gene transfers;Tofigh;IEEE/ACM Trans. Comput. Biol. Bioinform.,2011
5. El-Mabrouk, N., and Noutahi, E. (2019). Bioinformatics and Phylogenetics, Springer International Publishing.