BATCH-SCAMPP: Scaling phylogenetic placement methods to place many sequences-Reference-Cited by-同舟云学术

BATCH-SCAMPP: Scaling phylogenetic placement methods to place many sequences

Published:2022-10-27 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Wedell Eleanor^ORCID,Shen Chengze^ORCID,Warnow Tandy^ORCID

Abstract

AbstractPhylogenetic placement, the problem of placing sequences into phylogenetic trees, has been limited either by the number of sequences placed in a single run or by the size of the placement tree. The most accurate scalable phylogenetic placement method with respect to the number of query sequences placed, EPA-ng (Barbera et al., 2019), has a runtime that scales sub-linearly to the number of query sequences. However, larger phylogenetic trees cause an increase in EPA-ng’s memory usage, limiting the method to placement trees of up to 10,000 sequences. Our recently designed SCAMPP (Wedell et al., 2021) framework has been shown to scale EPA-ng to larger placement trees of up to 200,000 sequences by building a subtree for the placement of each query sequence. The approach of SCAMPP does not take advantage of EPA-ng’s parallel efficiency since it only places a single query for each run of EPA-ng. Here we present BATCH-SCAMPP, a new technique that overcomes this barrier and enables EPA-ng and other phylogenetic placement methods to scale to ultra-large backbone trees and many query sequences. BATCH-SCAMPP is freely available athttps://github.com/ewedell/BSCAMPP_code.

Publisher

Cold Spring Harbor Laboratory

Reference28 articles.

1. Fast and accurate distance-based phylogenetic placement using divide and conquer;Molecular Ecology Resources,2022

2. Fast and Accurate Distance-based Phylogenetic Placement using Divide and Conquer

3. Metin Balaban , Shahab Sarmashghi , and Siavash Mirarab . APPLES: distance-based phylogenetic placement for scalable and assembly-free sample identification. bioRxiv, page 475566, 2019.

4. APPLES: Scalable Distance-Based Phylogenetic Placement with or without Alignments

5. EPA-ng: massively parallel evolutionary placement of genetic sequences;Systematic Biology,2019

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Scaling DEPP phylogenetic placement to ultra-large reference trees: a tree-aware ensemble approach;Bioinformatics;2024-06

2. Scaling deep phylogenetic embedding to ultra-large reference trees: a tree-aware ensemble approach;2023-03-29