Systematic comparison of variant calling pipelines of target genome sequencing cross multiple next-generation sequencers

Author:

Feng Baosheng1,Lai Juan1,Fan Xue2,Guo Xin2,Liu Yongfeng1,Wang Miao1,Wu Ping1,Zhou Zhiliang1,Yan Qin1,Sun Lei1

Affiliation:

1. GeneMind Biosciences Company Limited

2. Longgang District Maternity&Child Healthcare Hospital of Shenzhen City

Abstract

Abstract Background Targeted genomic sequencing (TS) greatly benefits precision oncology by rapidly detecting genetic variations with better accuracy and sensitivity owing to its high sequencing depth. Multiple sequencing platforms and variant calling tools are available for TS, making it excruciating for researchers to choose. Therefore, benchmarking study across different platforms and pipelines available for TS is imperative. In this study, we performed a TSof Reference OncoSpan FFPE (HD832) sample enriched by TSO500 panel using four commercially available sequencers, and analyzed the output 50 datasets using five commonly-used bioinformatics pipelines. We systematically investigated the sequencing quality and variant detection sensitivity, expecting to provide optimal recommendations for future research. Results Four sequencing platforms returned highly concordant results in terms of base quality (Q20>94%), sequencing coverage (>97%) and depth (>2000×). Benchmarking revealed good concordance of variant calling across different platforms and pipelines, among which, FASTASeq 300 platform showed the highest sensitivity (100%) in high-confidence variants calling when analyzed by SNVer and VarScan 2 algorithms. Furthermore, this sequencer demonstrated the shortest sequencing time (~21 hr) at the sequencing mode PE150. Through the intersection of 50 datasets generated in this study, we recommended a novel set of variant genes outside the truth set published by HD832, expecting to replenish HD832 for future research of tumor variant diagnosis. Considering the dissimilarity of variant calls across different pipelines for datasets from the same platform, we recommended an integration of multiple tools to improve variant calling sensitivity and accuracy for the cancer genome. Conclusions Illumina and GeneMind technologies can be used independently or together by public health laboratories performing tumor TS. FASTASeq 300 platform performs better regarding variant detection sensitivity under SNVer and VarScan 2 algorithms along with the shortest turnaround time. Our study provides a standardized target sequencing resource to benchmark new bioinformatics protocols and sequencing platforms.

Publisher

Research Square Platform LLC

Reference22 articles.

1. Applications and analysis of targeted genomic sequencing in cancer studies;Bewicke-Copley F;Comput Struct Biotechnol J,2019

2. Next generation sequencing-based gene panel tests for the management of solid tumors;Nagahashi M;Cancer Sci,2019

3. Utility of incorporating next-generation sequencing (NGS) in an Asian non-small cell lung cancer (NSCLC) population: Incremental yield of actionable alterations and cost-effectiveness analysis;Tan AC;Lung Cancer,2020

4. TruSight Oncology 500 Portfolio. https://www.illumina.com/products/by-brand/trusight-oncology/tso-500-portfolio.html. Accessed 27 April 2023.

5. Zhao C, Jiang T, Ju JH, Zhang S, Tao J, Fu Y, Lococo J, Dockter J, Pawlowski T, Bilke S. TruSight oncology 500 enabling comprehensive genomic profiling and biomarker reporting with targeted sequencing. BioRxiv. 2020.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3