UnCoVar: a reproducible and scalable workflow for transparent and robust virus variant calling and lineage assignment using SARS-CoV-2 as an example
-
Published:2024-06-28
Issue:1
Volume:25
Page:
-
ISSN:1471-2164
-
Container-title:BMC Genomics
-
language:en
-
Short-container-title:BMC Genomics
Author:
Thomas Alexander,Battenfeld Thomas,Kraiselburd Ivana,Anastasiou Olympia,Dittmer Ulf,Dörr Ann-Kathrin,Dörr Adrian,Elsner Carina,Gosch Jule,Le-Trilling Vu Thuy Khanh,Magin Simon,Scholtysik René,Yilmaz Pelin,Trilling Mirko,Schöler Lara,Köster Johannes,Meyer Folker
Abstract
Abstract
Background
At a global scale, the SARS-CoV-2 virus did not remain in its initial genotype for a long period of time, with the first global reports of variants of concern (VOCs) in late 2020. Subsequently, genome sequencing has become an indispensable tool for characterizing the ongoing pandemic, particularly for typing SARS-CoV-2 samples obtained from patients or environmental surveillance. For such SARS-CoV-2 typing, various in vitro and in silico workflows exist, yet to date, no systematic cross-platform validation has been reported.
Results
In this work, we present the first comprehensive cross-platform evaluation and validation of in silico SARS-CoV-2 typing workflows. The evaluation relies on a dataset of 54 patient-derived samples sequenced with several different in vitro approaches on all relevant state-of-the-art sequencing platforms. Moreover, we present UnCoVar, a robust, production-grade reproducible SARS-CoV-2 typing workflow that outperforms all other tested approaches in terms of precision and recall.
Conclusions
In many ways, the SARS-CoV-2 pandemic has accelerated the development of techniques and analytical approaches. We believe that this can serve as a blueprint for dealing with future pandemics. Accordingly, UnCoVar is easily generalizable towards other viral pathogens and future pandemics. The fully automated workflow assembles virus genomes from patient samples, identifies existing lineages, and provides high-resolution insights into individual mutations. UnCoVar includes extensive quality control and automatically generates interactive visual reports. UnCoVar is implemented as a Snakemake workflow. The open-source code is available under a BSD 2-clause license at github.com/IKIM-Essen/uncovar.
Funder
Universitätsklinikum Essen
Publisher
Springer Science and Business Media LLC
Reference64 articles.
1. Jones JM, Manrique IM, Stone MS, Grebe E, Saa P, Germanio CD, Spencer BR, Notari E, Bravo M, Lanteri MC, et al. Estimates of SARS-CoV-2 seroprevalence and incidence of primary SARS-CoV-2 infections among blood donors, by COVID-19 Vaccination Status - United States, April 2021-September 2022. MMWR Morb Mortal Wkly Rep. 2023;72(22):601–5. 2. Bloom JD, Neher RA. Fitness effects of mutations to SARS-CoV-2 proteins. Virus Evol 2023, 9(2). 3. Harvey WT, Carabelli AM, Jackson B, Gupta RK, Thomson EC, Harrison EM, Ludden C, Reeve R, Rambaut A, Peacock SJ, et al. SARS-CoV-2 variants, spike mutations and immune escape. Nat Reviews Microbiol 2021. 2021;19(7):7. 4. Kemp SA, Collier DA, Datir RP, Ferreira I, Gayed S, Jahun A, Hosmillo M, Rees-Spear C, Mlcochova P, Lumb IU, et al. Author correction: SARS-CoV-2 evolution during treatment of chronic infection. Nature. 2022;608(7922):E23. 5. Markov PV, Ghafari M, Beer M, Lythgoe K, Simmonds P, Stilianakis NI, Katzourakis A, Markov PV, Ghafari M, Beer M, et al. The evolution of SARS-CoV-2. Nat Reviews Microbiol 2023. 2023;21(6):6.
|
|