Affiliation:
1. Center
for Microbial Ecology, Michigan State University, East Lansing,
Michigan
Abstract
ABSTRACT
Phylogenetic
sequence analysis of single or multiple genes has dominated the study
and census of the genetic diversity among closely related bacteria. It
remains unclear, however, how the results based on a few genes in the
genome correlate with whole-genome-based relatedness and what genes (if
any) best reflect whole-genome-level relatedness and hence should be
preferentially used to economize on cost and to improve accuracy. We
show here that phylogenies of closely related organisms based on the
average nucleotide identity (ANI) of their shared genes correspond
accurately to phylogenies based on state-of-the-art analysis of their
whole-genome sequences. We use ANI to evaluate the phylogenetic
robustness of every gene in the genome and show that almost all core
genes, regardless of their functions and positions in the genome, offer
robust phylogenetic reconstruction among strains that show 80 to 95%
ANI (16S rRNA identity, >98.5%). Lack of elapsed time and, to a
lesser extent, horizontal transfer and recombination make the selection
of genes more critical for applications that target the intraspecies
level, i.e., strains that show >95% ANI according to current
standards. A much more accurate phylogeny for the
Escherichia
coli
group was obtained based on just three best-performing genes
according to our analysis compared to the concatenated alignment of
eight genes that are commonly employed for phylogenetic purposes in
this group. Our results are reproducible within the
Salmonella
,
Burkholderia
, and
Shewanella
groups and therefore are expected to have general applicability for
microevolution studies, including metagenomic
surveys.
Publisher
American Society for Microbiology
Subject
Ecology,Applied Microbiology and Biotechnology,Food Science,Biotechnology
Cited by
173 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献