Multiple Mutations Associated with Emergent Variants Can Be Detected as Low-Frequency Mutations in Early SARS-CoV-2 Pandemic Clinical Samples
Author:
Kimbrel JeffreyORCID, Moon Joseph, Avila-Herrera Aram, Martí Jose ManuelORCID, Thissen James, Mulakken Nisha, Sandholtz Sarah H., Ferrell Tyshawn, Daum ChrisORCID, Hall Sara, Segelke Brent, Arrildt Kathryn T.ORCID, Messenger SharonORCID, Wadford Debra A.ORCID, Jaing Crystal, Allen Jonathan E.ORCID, Borucki Monica K.ORCID
Abstract
Genetic analysis of intra-host viral populations provides unique insight into pre-emergent mutations that may contribute to the genotype of future variants. Clinical samples positive for SARS-CoV-2 collected in California during the first months of the pandemic were sequenced to define the dynamics of mutation emergence as the virus became established in the state. Deep sequencing of 90 nasopharyngeal samples showed that many mutations associated with the establishment of SARS-CoV-2 globally were present at varying frequencies in a majority of the samples, even those collected as the virus was first detected in the US. A subset of mutations that emerged months later in consensus sequences were detected as subconsensus members of intra-host populations. Spike mutations P681H, H655Y, and V1104L were detected prior to emergence in variant genotypes, mutations were detected at multiple positions within the furin cleavage site, and pre-emergent mutations were identified in the nucleocapsid and the envelope genes. Because many of the samples had a very high depth of coverage, a bioinformatics pipeline, “Mappgene”, was established that uses both iVar and LoFreq variant calling to enable identification of very low-frequency variants. This enabled detection of a spike protein deletion present in many samples at low frequency and associated with a variant of concern.
Funder
Department of Energy, Office of Science
Subject
Virology,Infectious Diseases
Reference47 articles.
1. Nextstrain: Real-time tracking of pathogen evolution;Hadfield;Bioinformatics,2018 2. Hodcroft, E.B., Domman, D.B., Snyder, D.J., Oguntuyo, K.Y., Van Diest, M., Densmore, K.H., Schwalm, K.C., Femling, J., Carroll, J.L., and Scott, R.S. (2021). Emergence in late 2020 of multiple lineages of SARS-CoV-2 spike protein variants affecting amino acid position 677. medRxiv. 3. Latif, A.A., Mullen, J.L., Alkuzweny, M., Tsueng, G., Cano, M., Haag, E., Zhou, J., Zeller, M., Hufbauer, E., and Matteson, N. (2021, November 23). 4 Lineage Report, Outbreak.Info. Available online: https://outbreak.info/situation-reports?pango=AY.4.2. 4. Domingo, E., and Perales, C. (2019). Viral quasispecies. PLoS Genet., 15. 5. Borucki, M.K., Allen, J.E., Chen-Harris, H., Zemla, A., Vanier, G., Mabery, S., Torres, C., Hullinger, P., and Slezak, T. (2013). The role of viral population diversity in adaptation of bovine coronavirus to new host environments. PLOS ONE, 8.
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|