Statistical modeling of SARS-CoV-2 substitution processes: predicting the next variant-Reference-Cited by-同舟云学术

Statistical modeling of SARS-CoV-2 substitution processes: predicting the next variant

Published:2022-03-29 Issue:1 Volume:5 Page:
ISSN:2399-3642
Container-title:Communications Biology
language:en
Short-container-title:Commun Biol

Author:

Levinstein Hallak Keren,Rosset Saharon^ORCID

Abstract

AbstractWe build statistical models to describe the substitution process in the SARS-CoV-2 as a function of explanatory factors describing the sequence, its function, and more. These models serve two different purposes: first, to gain knowledge about the evolutionary biology of the virus; and second, to predict future mutations in the virus, in particular, non-synonymous amino acid substitutions creating new variants. We use tens of thousands of publicly available SARS-CoV-2 sequences and consider tens of thousands of candidate models. Through a careful validation process, we confirm that our chosen models are indeed able to predict new amino acid substitutions: candidates ranked high by our model are eight times more likely to occur than random amino acid changes. We also show that named variants were highly ranked by our models before their appearance, emphasizing the value of our models for identifying likely variants and potentially utilizing this knowledge in vaccine design and other aspects of the ongoing battle against COVID-19.

Publisher

Springer Science and Business Media LLC

Subject

General Agricultural and Biological Sciences,General Biochemistry, Genetics and Molecular Biology,Medicine (miscellaneous)

Link

https://www.nature.com/articles/s42003-022-03198-y.pdf

Reference67 articles.

1. Shereen, M. A., Khan, S., Kazmi, A., Bashir, N. & Siddique, R. COVID-19 infection: origin, transmission, and characteristics of human coronaviruses. J. Adv. Res. 24, 91–98 (2020).

2. Wang, H., Pipes, L. & Nielsen, R. Synonymous mutations and the molecular evolution of SARS-CoV-2 origins. Virus Evol. 7, veaa098 (2021).

3. Graudenzi, A., Maspero, D., Angaroni, F., Piazza, R. & Ramazzotti, D. Mutational signatures and heterogeneous host response revealed via large-scale characterization of SARS-CoV-2 genomic diversity. Iscience 24, 102116 (2021).

4. Mourier, T. et al. Host-directed editing of the SARS-COV-2 genome. Biochem. Biophys. Res. Commun. 538, 35–39 (2021).

5. Zhang, Z., Shen, L. & Gu, X. Evolutionary dynamics of mers-cov: potential recombination, positive selection and transmission. Sci. Rep. 6, 1–10 (2016).

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Research Progress on the Correlation Analysis between Reinfections of SARS-CoV-2 and Immunity;Advances in Clinical Medicine;2024

2. Modeling SARS-CoV-2 nucleotide mutations as a stochastic process;PLOS ONE;2023-04-28

3. A Computer Simulation of SARS-CoV-2 Mutation Spectra for Empirical Data Characterization and Analysis;Biomolecules;2022-12-28

4. Building a Resilient Scientific Network for COVID-19 and Beyond;mBio;2022-10-26