Fast-evolving alignment sites are highly informative for reconstructions of deep Tree of Life phylogenies

Author:

Thibério Rangel L.,Fournier Gregory P.

Abstract

AbstractThe trimming of fast-evolving sites, often known as “slow-fast” analysis, is broadly used in microbial phylogenetic reconstruction under assumption that fast-evolving sites do not retain accurate phylogenetic signal due to substitution saturation. Therefore, removing sites that have experienced multiple substitutions would improve the signal-to-noise ratio in phylogenetic analyses, with the remaining slower-evolving sites preserving a more reliable record of evolutionary relationships. Here we show that, contrary to this assumption, even the fastest evolving sites, present in conserved proteins often used in Tree of Life studies, contain reliable and valuable phylogenetic information, and that the trimming of such sites can negatively impact the accuracy of phylogenetic reconstruction. Simulated alignments modeled after ribosomal protein datasets used in Tree of Life studies consistently show that slow-evolving sites are less likely to recover true bipartitions than even the fastest-evolving sites. Furthermore, site specific substitution-rates are positively correlated with the frequency of accurately recovered short-branched bipartitions, as slowly evolving sites are less likely to have experienced substitutions along these intervals. Using published Tree of Life sequence alignment datasets, we additionally show that both slow-and fast-evolving sites contain similarly inconsistent phylogenetic signals, and that, for fast-evolving sites, this inconsistency can be attributed to poor alignment quality. Furthermore, trimming fast sites, slow sites, or both is shown to have substantial impact on phylogenetic reconstruction across multiple evolutionary models. This is perhaps most evident in the resulting placements of Eukarya and Asgardarchaeota groups, which are especially sensitive to the implementation of different trimming schemes.Significance StatementIt is common practice among comprehensive microbial phylogenetic studies to trim fast-evolving sites from the source alignment in the expectation to increase the signal to noise ratio. Here we show that despite fast-evolving sites being more sensitive to parameter misspecifications than mid-rate evolving sites, such sensitivity is comparable, if not smaller, than what we observe among slow-evolving sites. Through the use of both empirical and simulated datasets we also show that, besides the lack of evidences regarding the noisy nature of fast-evolving sites, such sites are of core importance for the reliable the reconstruction of short-branched bipartitions. Such points are exemplified by the variations in the Eukarya+Archaea Tree of Life when subjective alignment trimming strategies are employed.

Publisher

Cold Spring Harbor Laboratory

Cited by 3 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3