Benchmarking UMI-aware and standard variant callers on synthetic and real ctDNA datasets

Author:

Maruzani Rugare1,Brierley Liam1,Jorgensen Andrea1,Fowler Anna1

Affiliation:

1. University of Liverpool

Abstract

Abstract Background Circulating tumour DNA (ctDNA) is a subset of cell free DNA (cfDNA) released by tumour cells into the bloodstream. Circulating tumour DNA has shown great potential as a biomarker to inform treatment in cancer patients. Collecting ctDNA is minimally invasive and reflects the entire genetic makeup of a patient’s cancer. ctDNA variants in NGS data can be difficult to distinguish from sequencing and PCR artefacts due to low abundance, particularly in the early stages of cancer. Unique Molecular Identifiers (UMIs) are short sequences ligated to the sequencing library before amplification. These sequences are useful for filtering out low frequency artefacts. The utility of ctDNA as a cancer biomarker depends on accurate detection of cancer variants. Results In this study, we benchmarked six variant calling tools, including two UMI-aware callers for their ability to call ctDNA variants. The standard variant callers tested included Mutect2, bcftools, LoFreq and FreeBayes. The UMI-aware variant callers benchmarked were UMI-VarCal and UMIErrorCorrect. We used both real and synthetic datasets, with and without UMI sequences. Variant callers displayed different preferences for sensitivity and specificity. Mutect2 showed high sensitivity, while returning more privately called variants than any other caller in non-UMI data – an indicator of false positive variant discovery. In UMI encoded data, UMI-VarCal detected fewer putative false positive variants than all other callers in synthetic datasets. UMI-VarCal also called the highest percentage of COSMIC variants in real samples, and only 4.4% uniquely called variants indicating high sensitivity and specificity. Conclusions Our results indicate UMI-aware variant callers have potential to improve sensitivity and specificity in calling ctDNA variants over standard variant calling tools. There is a growing need for further development of UMI-aware variant calling tools if effective early detection methods for cancer using ctDNA samples are to be realised.

Publisher

Research Square Platform LLC

Reference43 articles.

1. Genome-wide cell-free DNA fragmentation in patients with cancer;Cristiano S;Nature,2019

2. Circulating tumor DNA methylation profiles enable early diagnosis, prognosis prediction, and screening for colorectal cancer;Luo H;Sci Transl Med,2020

3. Finding new cancer epigenetic and genetic biomarkers from cell-free DNA by combining SALP-seq and machine learning;Liu S;Comput Struct Biotechnol J,2020

4. Can tissue biopsy be replaced by liquid biopsy?;Ilié M;Transl Lung Cancer Res,2016

5. Whole exome sequencing of cell-free DNA – A systematic review and Bayesian individual patient data meta-analysis;Bos MK;Cancer Treat Rev,2020

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3