SPUMONI 2: improved classification using a pangenome index of minimizer digests-Reference-Cited by-同舟云学术

SPUMONI 2: improved classification using a pangenome index of minimizer digests

Published:2023-05-18 Issue:1 Volume:24 Page:
ISSN:1474-760X
Container-title:Genome Biology
language:en
Short-container-title:Genome Biol

Author:

Ahmed Omar Y.,Rossi Massimiliano,Gagie Travis,Boucher Christina,Langmead Ben^ORCID

Abstract

AbstractGenomics analyses use large reference sequence collections, like pangenomes or taxonomic databases. SPUMONI 2 is an efficient tool for sequence classification of both short and long reads. It performs multi-class classification using a novel sampled document array. By incorporating minimizers, SPUMONI 2’s index is 65 times smaller than minimap2’s for a mock community pangenome. SPUMONI 2 achieves a speed improvement of 3-fold compared to SPUMONI and 15-fold compared to minimap2. We show SPUMONI 2 achieves an advantageous mix of accuracy and efficiency in practical scenarios such as adaptive sampling, contamination detection and multi-class metagenomics classification.

Funder

Office of Advanced Cyberinfrastructure

National Human Genome Research Institute

Division of Biological Infrastructure

Natural Sciences and Engineering Research Council of Canada

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1186/s13059-023-02958-1.pdf

Reference42 articles.

1. Wood DE, Lu J, Langmead B. Improved metagenomic analysis with Kraken 2. Genome Biol. 2019;20(1):1–13.

2. Kim D, Song L, Breitwieser FP, Salzberg SL. Centrifuge: rapid and sensitive classification of metagenomic sequences. Genome Res. 2016;26(12):1721–9.