Abstract
AbstractWe present Samovar, a mosaic single-nucleotide variant (SNV) caller for linked-read whole-genome shotgun sequencing data. Samovar scores candidate sites using a random forest model trained using the input dataset that considers read quality, phasing, and linked-read characteristics. We show Samovar calls mosaic SNVs within a single sample with accuracy comparable to what previously required trios or matched tumor/normal pairs and outperform single-sample mosaic variant callers at MAF 5%-50% with at least 30x coverage. Furthermore, we use Samovar to find somatic variants in whole genome sequencing of both tumor and normal from 13 pediatric cancer cases that can be corroborated with high recall with whole exome sequencing. Samovar is available open-source at https://github.com/cdarby/samovar under the MIT license.
Publisher
Cold Spring Harbor Laboratory
Reference48 articles.
1. A genomic view of mosaicism and human disease;Nature Reviews Genetics,2013
2. Detecting somatic mosaicism: considerations and clinical implications;Clinical Genetics,2015
3. Mechanisms and consequences of somatic mosaicism in humans;Nature Reviews Genetics,2002
4. Donald Freed , Eric L Stevens , and Jonathan Pevsner . “Somatic Mosaicism in the Human Genome”. In: Genes 5.4 (2014). doi:10.3390/genes5041064.
5. Unrevealed mosaicism in the next-generation sequencing era;Molecular Genetics and Genomics,2016