Read2Tree: scalable and accurate phylogenetic trees from raw reads-Reference-Cited by-同舟云学术

Read2Tree: scalable and accurate phylogenetic trees from raw reads

Published:2022-04-19 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Dylus David^ORCID,Altenhoff Adrian^ORCID,Majidian Sina^ORCID,Sedlazeck Fritz J^ORCID,Dessimoz Christophe^ORCID

Abstract

AbstractThe inference of phylogenetic trees is foundational to biology. However, state-of-the-art phylogenomics requires running complex pipelines, at significant computational and labour costs, with additional constraints in sequencing coverage, assembly and annotation quality. To overcome these challenges, we present Read2Tree, which directly processes raw sequencing reads into groups of corresponding genes. In a benchmark encompassing a broad variety of datasets, our assembly-free approach was 10-100x faster than conventional approaches, and in most cases more accurate—the exception being when sequencing coverage was high and reference species very distant. To illustrate the broad applicability of the tool, we reconstructed a yeast tree of life of 435 species spanning 590 million years of evolution. Applied toCoronaviridaesamples, Read2Tree accurately classified highly diverse animal samples and near-identical SARS-CoV-2 sequences on a single tree—thereby exhibiting remarkable breadth and depth. The speed, accuracy, and versatility of Read2Tree enables comparative genomics at scale.

Publisher

Cold Spring Harbor Laboratory

Reference69 articles.

1. Phylogenetic structure of the prokaryotic domain: The primary kingdoms

2. Toward Automatic Reconstruction of a Highly Resolved Tree of Life

3. An archaeal origin of eukaryotes supports only two primary domains of life

4. A new view of the tree of life;Nat Microbiol,2016

5. Phylogenetic ctDNA analysis depicts early-stage lung cancer evolution