Comprehensive fundamental somatic variant calling and quality management strategies for human cancer genomes-Reference-Cited by-同舟云学术

Comprehensive fundamental somatic variant calling and quality management strategies for human cancer genomes

Published:2020-06-08 Issue:3 Volume:22 Page:
ISSN:1467-5463
Container-title:Briefings in Bioinformatics
language:en
Short-container-title:

Author:

He Xiaoyu^ORCID,Chen Shanyu,Li Ruilin,Han Xinyin,He Zhipeng,Yuan Danyang,Zhang Shuying,Duan Xiaohong,Niu Beifang

Abstract

Abstract Next-generation sequencing (NGS) technology has revolutionised human cancer research, particularly via detection of genomic variants with its ultra-high-throughput sequencing and increasing affordability. However, the inundation of rich cancer genomics data has resulted in significant challenges in its exploration and translation into biological insights. One of the difficulties in cancer genome sequencing is software selection. Currently, multiple tools are widely used to process NGS data in four stages: raw sequence data pre-processing and quality control (QC), sequence alignment, variant calling and annotation and visualisation. However, the differences between these NGS tools, including their installation, merits, drawbacks and application, have not been fully appreciated. Therefore, a systematic review of the functionality and performance of NGS tools is required to provide cancer researchers with guidance on software and strategy selection. Another challenge is the multidimensional QC of sequencing data because QC can not only report varied sequence data characteristics but also reveal deviations in diverse features and is essential for a meaningful and successful study. However, monitoring of QC metrics in specific steps including alignment and variant calling is neglected in certain pipelines such as the ‘Best Practices Workflows’ in GATK. In this review, we investigated the most widely used software for the fundamental analysis and QC of cancer genome sequencing data and provided instructions for selecting the most appropriate software and pipelines to ensure precise and efficient conclusions. We further discussed the prospects and new research directions for cancer genomics.

Funder

National Natural Science Foundation of China

National Key R&D Program of China

Chinese Academy of Sciences

Publisher

Oxford University Press (OUP)

Subject

Molecular Biology,Information Systems

Link

http://academic.oup.com/bib/article-pdf/22/3/bbaa083/37965947/bbaa083.pdf

Reference102 articles.

1. An integrative variant analysis suite for whole exome next-generation sequencing data;Challis;BMC Bioinform,2012