REViewer: haplotype-resolved visualization of read alignments in and around tandem repeats
-
Published:2022-08-11
Issue:1
Volume:14
Page:
-
ISSN:1756-994X
-
Container-title:Genome Medicine
-
language:en
-
Short-container-title:Genome Med
Author:
Dolzhenko EgorORCID, Weisburd Ben, Ibañez Kristina, Rajan-Babu Indhu-Shree, Anyansi Christine, Bennett Mark F., Billingsley Kimberley, Carroll Ashley, Clamons Samuel, Danzi Matt C., Deshpande Viraj, Ding Jinhui, Fazal Sarah, Halman Andreas, Jadhav Bharati, Qiu Yunjiang, Richmond Phillip A., Saunders Christopher T., Scheffler Konrad, van Vugt Joke J. F. A., Zwamborn Ramona R. A. J., Chong Samuel S., Friedman Jan M., Tucci Arianna, Rehm Heidi L., Eberle Michael A.,
Abstract
Abstract
Background
Expansions of short tandem repeats are the cause of many neurogenetic disorders including familial amyotrophic lateral sclerosis, Huntington disease, and many others. Multiple methods have been recently developed that can identify repeat expansions in whole genome or exome sequencing data. Despite the widely recognized need for visual assessment of variant calls in clinical settings, current computational tools lack the ability to produce such visualizations for repeat expansions. Expanded repeats are difficult to visualize because they correspond to large insertions relative to the reference genome and involve many misaligning and ambiguously aligning reads.
Results
We implemented REViewer, a computational method for visualization of sequencing data in genomic regions containing long repeat expansions and FlipBook, a companion image viewer designed for manual curation of large collections of REViewer images. To generate a read pileup, REViewer reconstructs local haplotype sequences and distributes reads to these haplotypes in a way that is most consistent with the fragment lengths and evenness of read coverage. To create appropriate training materials for onboarding new users, we performed a concordance study involving 12 scientists involved in short tandem repeat research. We used the results of this study to create a user guide that describes the basic principles of using REViewer as well as a guide to the typical features of read pileups that correspond to low confidence repeat genotype calls. Additionally, we demonstrated that REViewer can be used to annotate clinically relevant repeat interruptions by comparing visual assessment results of 44 FMR1 repeat alleles with the results of triplet repeat primed PCR. For 38 of these alleles, the results of visual assessment were consistent with triplet repeat primed PCR.
Conclusions
Read pileup plots generated by REViewer offer an intuitive way to visualize sequencing data in regions containing long repeat expansions. Laboratories can use REViewer and FlipBook to assess the quality of repeat genotype calls as well as to visually detect interruptions or other imperfections in the repeat sequence and the surrounding flanking regions. REViewer and FlipBook are available under open-source licenses at https://github.com/illumina/REViewer and https://github.com/broadinstitute/flipbook respectively.
Publisher
Springer Science and Business Media LLC
Subject
Genetics (clinical),Genetics,Molecular Biology,Molecular Medicine
Reference24 articles.
1. Roy S, Coldren C, Karunamurthy A, Kip NS, Klee EW, Lincoln SE, et al. Standards and guidelines for validating next-generation sequencing bioinformatics pipelines: a joint recommendation of the Association for Molecular Pathology and the College of American Pathologists. J Mol Diagn. 2018;20(1):4–27. 2. Robinson JT, Thorvaldsdóttir H, Winckler W, Guttman M, Lander ES, Getz G, et al. Integrative genomics viewer. Nat Biotechnol. 2011;29:24–6. https://doi.org/10.1038/nbt.1754. 3. Buels R, Yao E, Diesh CM, Hayes RD, Munoz-Torres M, Helt G, et al. JBrowse: a dynamic web platform for genome visualization and analysis. Genome Biol. 2016;17:66. 4. Gymrek M. PyBamView: a browser-based application for viewing short read alignments. Bioinformatics. 2014;30(23):3405–7. 5. Nattestad M, Aboukhalil R, Chin CS, Schatz MC. Ribbon: intuitive visualization for complex genomic variation. Bioinformatics. 2021;37(3):413–5.
Cited by
20 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|