Dental plaque microbiota sequence counts for microbial profiling and resistance genes detection
-
Published:2024-05-06
Issue:1
Volume:108
Page:
-
ISSN:0175-7598
-
Container-title:Applied Microbiology and Biotechnology
-
language:en
-
Short-container-title:Appl Microbiol Biotechnol
Author:
Veschetti Laura, Paiella Salvatore, Carelli Maria, Zotti Francesca, Secchettin Erica, Malleo Giuseppe, Signoretto Caterina, Zulianello Giorgia, Nocini Riccardo, Crovetto Anna, Salvia Roberto, Bassi Claudio, Malerba GiovanniORCID
Abstract
Abstract
Shotgun metagenomics sequencing experiments are finding a wide range of applications. Nonetheless, there are still limited guidelines regarding the number of sequences needed to acquire meaningful information for taxonomic profiling and antimicrobial resistance gene (ARG) identification. In this study, we explored this issue in the context of oral microbiota by sequencing with a very high number of sequences (~ 100 million), four human plaque samples, and one microbial community standard and by evaluating the performance of microbial identification and ARGs detection through a downsampling procedure. When investigating the impact of a decreasing number of sequences on quantitative taxonomic profiling in the microbial community standard datasets, we found some discrepancies in the identified microbial species and their abundances when compared to the expected ones. Such differences were consistent throughout downsampling, suggesting their link to taxonomic profiling methods limitations. Overall, results showed that the number of sequences has a great impact on metagenomic samples at the qualitative (i.e., presence/absence) level in terms of loss of information, especially in experiments having less than 40 million reads, whereas abundance estimation was minimally affected, with only slight variations observed in low-abundance species. The presence of ARGs was also assessed: a total of 133 ARGs were identified. Notably, 23% of them inconsistently resulted as present or absent across downsampling datasets of the same sample. Moreover, over half of ARGs were lost in datasets having less than 20 million reads. This study highlights the importance of carefully considering sequencing aspects and suggests some guidelines for designing shotgun metagenomics experiments with the final goal of maximizing oral microbiome analyses. Our findings suggest varying optimized sequence numbers according to different study aims: 40 million for microbiota profiling, 50 million for low-abundance species detection, and 20 million for ARG identification.
Key points
• Forty million sequences are a cost-efficient solution for microbiota profiling
• Fifty million sequences allow low-abundance species detection
• Twenty million sequences are recommended for ARG identification
Graphical Abstract
Funder
FIMP Università degli Studi di Verona
Publisher
Springer Science and Business Media LLC
Reference26 articles.
1. Andrews S (2010). FastQC: A Quality Control Tool for High Throughput Sequence Data [Online]. Available online at: http://www.bioinformatics.babraham.ac.uk/projects/fastqc/ 2. Beghini F, McIver LJ, Blanco-Míguez A, Dubois L, Asnicar F, Maharjan S, Mailyan A, Manghi P, Scholz M, Thomas AM, Valles-Colomer M, Weingart G, Zhang Y, Zolfo M, Huttenhower C, Franzosa EA, Segata N (2021) Integrating taxonomic, functional, and strain-level profiling of diverse microbial communities with bioBakery 3. ELife 10:e65088. https://doi.org/10.7554/eLife.65088 3. Berg G, Rybakova D, Fischer D, Cernava T, Vergès M-CC, Charles T, Chen X, Cocolin L, Eversole K, Corral GH, Kazou M, Kinkel L, Lange L, Lima N, Loy A, Macklin JA, Maguin E, Mauchline T, McClure R, Mitter B, Ryan M, Sarand I, Smidt H, Schelkle B, Roume H, Kiran GS, Selvin J, de Souza RSC, van Overbeek L, Singh BK, Wagner M, Walsh A, Sessitsch A, Schloter M (2020) Microbiome definition re-visited: old concepts and new challenges. Microbiome 8:103. https://doi.org/10.1186/s40168-020-00875-0 4. Bolger AM, Lohse M, Usadel B (2014) Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120. https://doi.org/10.1093/bioinformatics/btu170 5. Campanaro S, Treu L, Kougias PG, Zhu X, Angelidaki I (2018) Taxonomy of anaerobic digestion microbiome reveals biases associated with the applied high throughput sequencing strategies. Sci Rep 8:1926. https://doi.org/10.1038/s41598-018-20414-0
|
|