Family reunion via error correction: An efficient analysis of duplex sequencing data-Reference-Cited by-同舟云学术

Family reunion via error correction: An efficient analysis of duplex sequencing data

Published:2018-11-14 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Stoler Nicholas,Arbeithuber Barbara^ORCID,Povysil Gundula,Heinzl Monika,Salazar Renato,Makova Kateryna^ORCID,Tiemann-Boege Irene^ORCID,Nekrutenko Anton^ORCID

Abstract

AbstractDuplex sequencing is the most accurate approach for identification of sequence variants present at very low frequencies. Its power comes from pooling together multiple descendants of both strands of original DNA molecules, which allows distinguishing true nucleotide substitutions from PCR amplification and sequencing artifacts. This strategy comes at a cost—sequencing the same molecule multiple times increases dynamic range but significantly diminishes coverage, making whole genome duplex sequencing prohibitively expensive. Furthermore, every duplex experiment produces a substantial proportion of singleton reads that cannot be used in the analysis and are, technically, thrown away. In this paper we demonstrate that a significant fraction of these reads contains PCR or sequencing errors within duplex tags. Correction of such errors allows “reuniting” these reads with their respective families increasing the output of the method and making it more cost effective. Additionally, we combine error correction strategy with a number of algorithmic improvements in a new version of the duplex analysis software, Du Novo 2.0, readily available through Galaxy, Bioconda, and as the source code.

Publisher

Cold Spring Harbor Laboratory

Reference15 articles.

1. Fennell T , Homer N. 2018. fgbio. fulcrumgenomics https://github.com/fulcrumgenomics/fgbio (Accessed July 5, 2018).

2. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome

3. Kalign2: high-performance multiple alignment of protein and nucleotide sequences allowing external features

4. Mei H , Arbeithuber B , Cremona M , DeGeorgio M , Nekrutenko A. 2018. A high resolution view of adaptive events. http://dx.doi.org/10.1101/429175.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Increased yields of duplex sequencing data by a series of quality control tools;2019-12-05