Abstract
AbstractWith the development of Next-Generation Sequencing (NGS) technologies it became possible to simultaneously analyze millions of variants. Despite the quality improvement it is generally still required to confirm the variants before reporting. However, in recent years the dominant idea is that one could define the quality thresholds for “high quality” variants which do not require orthogonal validation. Despite that, no works to date report the concordance between variants from whole genome sequencing and their gold-standard Sanger validation. In this study we analyzed the concordance for 1756 WGS variants in order to establish the appropriate thresholds for high-quality variants filtering. Resulting thresholds allowed us to drastically reduce the number of variants which require validation, to 5,6% and 1.2% of the initial set for caller-agnostic thresholds and caller-dependent QUAL threshold respectively.
Publisher
Cold Spring Harbor Laboratory