High-resolution shotgun metagenomics: the more data, the better?-Reference-Cited by-同舟云学术

High-resolution shotgun metagenomics: the more data, the better?

Published:2022-10-30 Issue:6 Volume:23 Page:
ISSN:1467-5463
Container-title:Briefings in Bioinformatics
language:en
Short-container-title:

Author:

Tremblay Julien¹^ORCID,Schreiber Lars¹^ORCID,Greer Charles W¹^ORCID

Affiliation:

1. Energy Mining and Environment Research Centre, National Research Council Canada , Montreal, QC, Canada H4P-2R2

Abstract

Abstract In shotgun metagenomics (SM), the state-of-the-art bioinformatic workflows are referred to as high-resolution shotgun metagenomics (HRSM) and require intensive computing and disk storage resources. While the increase in data output of the latest iteration of high-throughput DNA sequencing systems can allow for unprecedented sequencing depth at a minimal cost, adjustments in HRSM workflows will be needed to properly process these ever-increasing sequence datasets. One potential adaptation is to generate so-called shallow SM datasets that contain fewer sequencing data per sample as compared with the more classic high coverage sequencing. While shallow sequencing is a promising avenue for SM data analysis, detailed benchmarks using real-data are lacking. In this case study, we took four public SM datasets, one massive and the others moderate in size and subsampled each dataset at various levels to mimic shallow sequencing datasets of various sequencing depths. Our results suggest that shallow SM sequencing is a viable avenue to obtain sound results regarding microbial community structures and that high-depth sequencing does not bring additional elements for ecological interpretation. More specifically, results obtained by subsampling as little as 0.5 M sequencing clusters per sample were similar to the results obtained with the largest subsampled dataset for human gut and agricultural soil datasets. For an Antarctic dataset, which contained only a few samples, 4 M sequencing clusters per sample was found to generate comparable results to the full dataset. One area where ultra-deep sequencing and maximizing the usage of all data was undeniably beneficial was in the generation of metagenome-assembled genomes.

Publisher

Oxford University Press (OUP)

Subject

Molecular Biology,Information Systems

Link

https://academic.oup.com/bib/article-pdf/23/6/bbac443/47244300/bbac443.pdf

Reference47 articles.

1. A review of methods and databases for metagenomic classification and assembly;Breitwieser;Brief Bioinform,2019

2. A review of computational tools for generating metagenome-assembled genomes from metagenomic sequencing data;Yang;Comput Struct Biotechnol J,2021

3. Ray Meta: scalable de novo metagenome assembly and profiling;Boisvert;Genome Biol,2012

4. Critical Assessment of Metagenome Interpretation-a benchmark of metagenomics software;Sczyrba;Nat Methods,2017

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. HPC-T-Annotator: an HPC tool for de novo transcriptome assembly annotation;BMC Bioinformatics;2024-08-21

2. Metagenome quality metrics and taxonomical annotation visualization through the integration of MAGFlow and BIgMAG;F1000Research;2024-06-17

3. Mock community taxonomic classification performance of publicly available shotgun metagenomics pipelines;Scientific Data;2024-01-17

4. Niche differentiation in microbial communities with stable genomic traits over time in engineered systems;The ISME Journal;2024-01-01

5. Intermittent water stress favors microbial traits that better help wheat under drought;ISME Communications;2024-01