Accuracy of taxonomy prediction for 16S rRNA and fungal ITS sequences-Reference-Cited by-同舟云学术

Accuracy of taxonomy prediction for 16S rRNA and fungal ITS sequences

Published:2018-04-18 Issue: Volume:6 Page:e4652
ISSN:2167-8359
Container-title:PeerJ
language:en
Short-container-title:

Author:

Edgar Robert C.^ORCID

Abstract

Prediction of taxonomy for marker gene sequences such as 16S ribosomal RNA (rRNA) is a fundamental task in microbiology. Most experimentally observed sequences are diverged from reference sequences of authoritatively named organisms, creating a challenge for prediction methods. I assessed the accuracy of several algorithms using cross-validation by identity, a new benchmark strategy which explicitly models the variation in distances between query sequences and the closest entry in a reference database. When the accuracy of genus predictions was averaged over a representative range of identities with the reference database (100%, 99%, 97%, 95% and 90%), all tested methods had ≤50% accuracy on the currently-popular V4 region of 16S rRNA. Accuracy was found to fall rapidly with identity; for example, better methods were found to have V4 genus prediction accuracy of ∼100% at 100% identity but ∼50% at 97% identity. The relationship between identity and taxonomy was quantified as the probability that a rank is the lowest shared by a pair of sequences with a given pair-wise identity. With the V4 region, 95% identity was found to be a twilight zone where taxonomy is highly ambiguous because the probabilities that the lowest shared rank between pairs of sequences is genus, family, order or class are approximately equal.

Publisher

PeerJ

Subject

General Agricultural and Biological Sciences,General Biochemistry, Genetics and Molecular Biology,General Medicine,General Neuroscience

Link

https://peerj.com/articles/4652.pdf

Reference55 articles.

1. SPINGO: a rapid species-classifier for microbial amplicon sequences;Allard;BMC Bioinformatics,2015

2. Basic local alignment search tool;Altschul;Journal of Molecular Biology,1990

3. metaxa2: improved identification and taxonomic classification of small and large subunit rRNA in metagenomic data;Bengtsson-Palme;Molecular Ecology Resources,2015

4. Trade-offs between microbiome diversity and productivity in a stratified microbial mat;Bernstein;ISME Journal,2017

5. Optimizing taxonomic classification of marker gene;Bokulich;PeerJ Preprints,2017

Cited by 230 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Characterization of core microbiota of barley seeds from different continents for origin tracing and quarantine pathogen assessment;Food Microbiology;2024-12

2. Modelling soil prokaryotic traits across environments with the trait sequence database ampliconTraits and the R package MicEnvMod;Ecological Informatics;2024-11

3. Specific gut microbiome’s role in skin pigmentation: insights from SCARB1 mutants in Oujiang colour common carp;Journal of Applied Microbiology;2024-09

4. Taxonomic composition and functional potentials of gastrointestinal microbiota in 12 wild-stranded cetaceans;Frontiers in Microbiology;2024-08-29

5. Testing the sequence of successional processes in miniature ecosystems;Microbiology Spectrum;2024-08-27