Integrated variant allele frequency analysis pipeline and R package: easyVAF

Author:

Hu Junxiao12,Alami Vida1,Zhuang Yonghua12,Alzofon Nathaniel3,Jimeno Antonio3,Gao Dexiang12

Affiliation:

1. Biostatistics Shared Resource (RRID: SCR_021981), University of Colorado Cancer Center University of Colorado Anschutz Medical Campus Aurora Colorado USA

2. Department of Pediatrics, School of Medicine University of Colorado Anschutz Medical Campus Aurora Colorado USA

3. Division of Medical Oncology, School of Medicine University of Colorado Anschutz Medical Campus Aurora Colorado USA

Abstract

AbstractSomatic sequence variants are associated with cancer diagnosis, prognostic stratification, and treatment response. Variant allele frequency (VAF), the percentage of sequence reads with a specific DNA variant over the read depth at that locus, has been used as a metric to quantify mutation rates in these applications. VAF has the potential for feature detection by reflecting changes in tumor clonal composition across treatments or time points. Although there are several packages, including Genome Analysis Toolkit and VarScan, designed for variant calling and rare mutation identification, there is no readily available package for comparing VAFs among and between groups to identify loci of interest. To this end, we have developed the R package easyVAF, which includes parametric and nonparametric tests to compare VAFs among multiple groups. It is accompanied by an interactive R Shiny app. With easyVAF, the investigator has the option between three statistical tests to maximize power while maintaining an acceptable type I error rate. This paper presents our proposed pipeline for VAF analysis, from quality checking to group comparison. We evaluate our method in a wide range of simulated scenarios and show that choosing the appropriate test to limit the type I error rate is critical. For situations where data is sparse, we recommend comparing VAFs with the beta‐binomial likelihood ratio test over Fisher's exact test and Pearson's χ2 test.

Publisher

Wiley

Subject

Cancer Research,Molecular Biology

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3