SINTAX: a simple non-Bayesian taxonomy classifier for 16S and ITS sequences


Edgar Robert C.


AbstractMetagenomics experiments often characterize microbial communities by sequencing the ribosomal 16S and ITS regions. Taxonomy prediction is a fundamental step in such studies. The SINTAX algorithm predicts taxonomy by using k-mer similarity to identify the top hit in a reference database and provides bootstrap confidence for all ranks in the prediction. SINTAX achieves comparable or better accuracy to the RDP Naive Bayesian Classifier with a simpler algorithm that does not require training. Most tested methods are shown to have high rates of over-classification errors where novel taxa are incorrectly predicted to have known names.


Cold Spring Harbor Laboratory

Reference16 articles.

1. QIIME allows analysis of high-throughput community sequencing data

2. Greengenes, a Chimera-Checked 16S rRNA Gene Database and Workbench Compatible with ARB

3. Deshpande, V. et al. (2015) Fungal identification using a Bayesian classifier and the Warcup training set of internal transcribed spacer sequences. Mycologia, 14–293–.

4. UPARSE: highly accurate OTU sequences from microbial amplicon reads

5. A framework for human microbiome research







Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3