Affiliation:
1. Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative—UMR 7238, Paris, France, Paris, France
2. Institut Universitaire de France, Paris, France
Abstract
Abstract
Gene order can be used as an informative character to reconstruct phylogenetic relationships between species independently from the local information present in gene/protein sequences. PhyChro is a reconstruction method based on chromosomal rearrangements, applicable to a wide range of eukaryotic genomes with different gene contents and levels of synteny conservation. For each synteny breakpoint issued from pairwise genome comparisons, the algorithm defines two disjoint sets of genomes, named partial splits, respectively, supporting the two block adjacencies defining the breakpoint. Considering all partial splits issued from all pairwise comparisons, a distance between two genomes is computed from the number of partial splits separating them. Tree reconstruction is achieved through a bottom-up approach by iteratively grouping sister genomes minimizing genome distances. PhyChro estimates branch lengths based on the number of synteny breakpoints and provides confidence scores for the branches. PhyChro performance is evaluated on two data sets of 13 vertebrates and 21 yeast genomes by using up to 130,000 and 179,000 breakpoints, respectively, a scale of genomic markers that has been out of reach until now. PhyChro reconstructs very accurate tree topologies even at known problematic branching positions. Its robustness has been benchmarked for different synteny block reconstruction methods. On simulated data PhyChro reconstructs phylogenies perfectly in almost all cases, and shows the highest accuracy compared with other existing tools. PhyChro is very fast, reconstructing the vertebrate and yeast phylogenies in <15 min.
Funder
Agence Nationale de la Recherche
Institut Universitaire de France
Publisher
Oxford University Press (OUP)
Subject
Genetics,Molecular Biology,Ecology, Evolution, Behavior and Systematics
Reference70 articles.
1. A canonical decomposition theory for metrics on a finite set;Bandelt;Adv Math,1992
2. Phylogenetic reconstruction and lateral gene transfer;Bapteste;Trends Microbiol,2004
3. The use of genome-level characters for phylogenetic reconstruction;Boore;Trends Ecol Evol,2006
4. Genome-scale evolution: reconstructing gene orders in the ancestral species;Bourque;Genome Res,2002
Cited by
36 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献