Evaluation of 10 Different Pipelines for Bacterial Single-Nucleotide Variant Detection

Author:

Hu Zi-Hao,Wang Ying1,Yang Long2,Cao Qing-Yi3,Ling Ming4,Meng Xiao-Hua3,Chen Yao3,Ni Shu-Jun3,Chen Zhi3,Liu Cheng-Zhi1,Su Kun-Kai3

Affiliation:

1. Hangzhou Digital-Micro Biotech Co, Ltd, Hangzhou, China

2. Jinan Microecological Biomedicine Shandong Laboratory, Jinan, China

3. State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, the First Affiliated Hospital, School of Medicine, Zhejiang University, Hangzhou, China

4. Jinhua Institute for Food and Drug Control, Jinhua, China.

Abstract

Abstract Bacterial genome sequencing is a powerful technique for studying the genetic diversity and evolution of microbial populations. However, the detection of genomic variants from sequencing data is challenging due to the presence of contamination, sequencing errors and multiple strains within the same species. Several bioinformatics tools have been developed to address these issues, but their performance and accuracy have not been systematically evaluated. In this study, we compared 10 variant detection pipelines using 18 simulated and 17 real datasets of high-throughput sequences from a bundle of representative bacteria. We assessed the sensitivity of each pipeline under different conditions of coverage, simulation and strain diversity. We also demonstrated the application of these tools to identify consistent mutations in a 30-time repeated sequencing dataset of Staphylococcus hominis. We found that HaplotypeCaller, but not Mutect2, from the GATK tool set showed the best performance in terms of accuracy and robustness. CFSAN and Snippy performed not as well in several simulated and real sequencing datasets. Our results provided a comprehensive benchmark and guidance for choosing the optimal variant detection pipeline for high-throughput bacterial genome sequencing data.

Publisher

Ovid Technologies (Wolters Kluwer Health)

Subject

Microbiology (medical),Infectious Diseases,Epidemiology

Reference46 articles.

1. In silico evaluation of variant calling methods for bacterial whole-genome sequencing assays;J Clin Microbiol,2023

2. Children with infectious pneumonia caused by Ralstonia insidiosa: a case report;World J Clin Cases,2023

3. Cronobacter sakazakii induces necrotizing enterocolitis by regulating NLRP3 inflammasome expression via TLR4;J Med Microbiol,2020

4. Ceftolozane/tazobactam heteroresistance in cystic fibrosis–related Pseudomonas aeruginosa infections;JAC Antimicrob Resist,2023

5. Evaluation of SNP calling methods for closely related bacterial isolates and a novel high-accuracy pipeline: BactSNP;Microb Genom,2019

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3