Benchmarking Low-Frequency Variant Calling With Long-Read Data on Mitochondrial DNA-Reference-Cited by-同舟云学术

Benchmarking Low-Frequency Variant Calling With Long-Read Data on Mitochondrial DNA

Published:2022-05-19 Issue: Volume:13 Page:
ISSN:1664-8021
Container-title:Frontiers in Genetics
language:
Short-container-title:Front. Genet.

Author:

Lüth Theresa,Schaake Susen,Grünewald Anne,May Patrick,Trinh Joanne,Weissensteiner Hansi

Abstract

Background: Sequencing quality has improved over the last decade for long-reads, allowing for more accurate detection of somatic low-frequency variants. In this study, we used mixtures of mitochondrial samples with different haplogroups (i.e., a specific set of mitochondrial variants) to investigate the applicability of nanopore sequencing for low-frequency single nucleotide variant detection.Methods: We investigated the impact of base-calling, alignment/mapping, quality control steps, and variant calling by comparing the results to a previously derived short-read gold standard generated on the Illumina NextSeq. For nanopore sequencing, six mixtures of four different haplotypes were prepared, allowing us to reliably check for expected variants at the predefined 5%, 2%, and 1% mixture levels. We used two different versions of Guppy for base-calling, two aligners (i.e., Minimap2 and Ngmlr), and three variant callers (i.e., Mutserve2, Freebayes, and Nanopanel2) to compare low-frequency variants. We used F1 score measurements to assess the performance of variant calling.Results: We observed a mean read length of 11 kb and a mean overall read quality of 15. Ngmlr showed not only higher F1 scores but also higher allele frequencies (AF) of false-positive calls across the mixtures (mean F1 score = 0.83; false-positive allele frequencies < 0.17) compared to Minimap2 (mean F1 score = 0.82; false-positive AF < 0.06). Mutserve2 had the highest F1 scores (5% level: F1 score >0.99, 2% level: F1 score >0.54, and 1% level: F1 score >0.70) across all callers and mixture levels.Conclusion: We here present the benchmarking for low-frequency variant calling with nanopore sequencing by identifying current limitations.

Publisher

Frontiers Media SA

Subject

Genetics (clinical),Genetics,Molecular Medicine

Reference53 articles.

1. Detection of Ultra-rare Mitochondrial Mutations in Breast Stem Cells by Duplex Sequencing;Ahn;PLoS One,2015

2. long-read-tools.org: an Interactive Catalogue of Analysis Methods for Long-Read Sequencing Data;Amarasinghe;Gigascience,2021

3. Mitochondria in Neuroinflammation - Multiple Sclerosis (MS), Leber Hereditary Optic Neuropathy (LHON) and LHON-MS;Bargiela;Neurosci. Lett.,2019

4. Calling Somatic SNVs and Indels with Mutect2;Benjamin;bioRxiv,2019

5. Single-molecule Mitochondrial DNA Sequencing Shows No Evidence of CpG Methylation in Human Cells and Tissues;Bicci;Nucleic Acids Res.,2021

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. mtDNA-Server 2: advancing mitochondrial DNA analysis through highly parallelized data processing and interactive analytics;Nucleic Acids Research;2024-05-06

2. North and East African mitochondrial genetic variation needs further characterization towards precision medicine;Journal of Advanced Research;2023-12

3. Evaluating the performance of low-frequency variant calling tools for the detection of variants from short-read deep sequencing data;Scientific Reports;2023-11-22

4. CmVCall: An automated and adjustable nanopore analysis pipeline for heteroplasmy detection of the control region in human mitochondrial genome;Forensic Science International: Genetics;2023-11

5. POLG2-Linked Mitochondrial Disease: Functional Insights from New Mutation Carriers and Review of the Literature;The Cerebellum;2023-04-22