Benchmarking Bioinformatic Virus Identification Tools Using Real-World Metagenomic Data across Biomes-Reference-Cited by-同舟云学术

Benchmarking Bioinformatic Virus Identification Tools Using Real-World Metagenomic Data across Biomes

Published:2023-04-28 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Wu Ling-Yi^ORCID,Pappas Nikolaos,Wijesekara Yasas,Piedade Gonçalo J.^ORCID,Brussaard Corina P.D.^ORCID,Dutilh Bas E.^ORCID

Abstract

ABSTRACTAs most viruses remain uncultivated, metagenomics is currently the main method for virus discovery. Detecting viruses in metagenomic data is not trivial. In the past few years, many bioinformatic virus identification tools have been developed for this task, making it challenging to choose the right tools, parameters, and cutoffs. As all these tools measure different biological signals, and use different algorithms and training/reference databases, it is imperative to conduct an independent benchmarking to give users objective guidance. We compared the performance of ten state-of-the-art virus identification tools in thirteen modes on eight paired viral and microbial datasets from three distinct biomes, including a new complex dataset from Antarctic coastal waters. The tools had highly variable true positive rates (0 – 68%) and false positive rates (0 – 15%). PPR-Meta best distinguished viral from microbial contigs, followed by DeepVirFinder, VirSorter2, and VIBRANT. Different tools identified different subsets of the benchmarking data and all tools, except for Sourmash, found unique viral contigs. Tools performance could be improved with adjusted parameter cutoffs, indicating that adjustment of parameter cutoffs before usage should be considered. Together, our independent benchmarking provides guidance on choices of bioinformatic virus identification tools and gives suggestions for parameter adjustments for viromics researchers.

Publisher

Cold Spring Harbor Laboratory

Reference87 articles.

1. Revisiting the rules of life for viruses of microorganisms;Nat. Rev. Microbiol,2021

2. Species–function relationships shape ecological properties of the human gut microbiome

3. Deciphering the virus-to-prokaryote ratio (VPR): insights into virus–host relationships in a variety of ecosystems;Biol. Rev,2017

4. Single-cell genomics-based analysis of virus–host interactions in marine surface bacterioplankton

5. Marine viruses and their biogeochemical and ecological effects

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Optimization of screening methods leads to the discovery of new viruses in black soldier flies (Hermetia illucens);2024-08-27

2. Seasonal dynamics and diversity of Antarctic marine viruses reveal a novel viral seascape;2024-01-15

3. ProkBERT family: genomic language models for microbiome applications;Frontiers in Microbiology;2024-01-12

4. ProkBERT Family: Genomic Language Models for Microbiome Applications;2023-11-13

5. The International Virus Bioinformatics Meeting 2023;Viruses;2023-09-30