Fast and accurate DNASeq Variant Calling workflow composed of LUSH toolkit

Author:

Wang TaifuORCID,Zhang Youjin,Wang Haoling,Zheng Qiwen,Yang Jiaobo,Zhang Tiefeng,Sun Geng,Liu Weicong,Yin Longhui,He Xinqiu,You Rui,Wang Chu,Liu Zhencheng,Liu Zhijian,Wang Jin’an,Jin Xiangqian,He Zengquan

Abstract

AbstractBackgroundWhole genome sequencing (WGS) is becoming increasingly prevalent for molecular diagnosis, staging and prognosis because of its declining costs and the ability to detect nearly all genes associated with a patient’s disease. The currently widely accepted variant calling pipeline, GATK, is limited in terms of its computational speed and efficiency, which cannot meet the growing analysis needs.MethodsIn this study, we propose a fast and accurate DNASeq variant calling workflow that is purely composed of tools from LUSH toolkit. The LUSH pipeline is highly optimized for the WGS pipeline based on SOAPnuke, BWA and GATK which can be deployed on any general-purpose CPU-based computing system. We validated the accuracy, speed and scalability of the LUSH pipeline on several standard WGS datasets.ResultsOur test results show that the LUSH pipeline and the GATK pipeline are highly consistent in terms of accuracy, achieving over 99% precision and recall on NA12878. For speed, the LUSH pipeline completes 30x WGS data in 1.6 hours, which is about 17x faster than the GATK pipeline. From BAM to VCF, LUSH_HC even takes only 12 minutes, about 76x faster than GATK. Moreover, the LUSH pipeline shows favorable scalability in terms of thread and sequencing depth.ConclusionThe LUSH pipeline provides far superior computational speed to GATK while maintaining a high level of accuracy comparable to that of GATK, which greatly facilitates bedside analysis of acute patients, large-scale cohort data analysis, and variant calling in crop breeding programs.

Publisher

Cold Spring Harbor Laboratory

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3