Abstract
The unprecedented pace of the sequencing of the SARS-CoV-2 virus genomes provides us with unique information about the genetic changes in a single pathogen during ongoing pandemic. By the analysis of close to 200,000 genomes we show that the patterns of the SARS-CoV-2 virus mutations along its genome are closely correlated with the structural and functional features of the encoded proteins. Requirements of foldability of proteins’ 3D structures and the conservation of their key functional regions, such as protein-protein interaction interfaces, are the dominant factors driving evolutionary selection in protein-coding genes. At the same time, avoidance of the host immunity leads to the abundance of mutations in other regions, resulting in high variability of the missense mutation rate along the genome. “Unexplained” peaks and valleys in the mutation rate provide hints on function for yet uncharacterized genomic regions and specific protein structural and functional features they code for. Some of these observations have immediate practical implications for the selection of target regions for PCR-based COVID-19 tests and for evaluating the risk of mutations in epitopes targeted by specific antibodies and vaccine design strategies.
Funder
National Institute of General Medical Sciences
National Institute of Allergy and Infectious Diseases
Publisher
Public Library of Science (PLoS)
Subject
Computational Theory and Mathematics,Cellular and Molecular Neuroscience,Genetics,Molecular Biology,Ecology,Modelling and Simulation,Ecology, Evolution, Behavior and Systematics
Reference51 articles.
1. Data, disease and diplomacy: GISAID’s innovative contribution to global health.;S Elbe;Glob Chall.,2017
2. We shouldn’t worry when a virus mutates during disease outbreaks.;ND Grubaugh;Nat Microbiol.,2020
3. Spike mutation pipeline reveals the emergence of a more transmissible form of SARS-CoV-2.;B Korber;bioRxiv,2020
4. SARS-CoV-2 spike-protein D614G mutation increases virion spike density and infectivity.;L Zhang;Nat Commun.,2020
5. Early empirical assessment of the N501Y mutant strains of SARS-CoV-2 in the United Kingdom, October to November 2020.;K Leung;medRxiv,2020
Cited by
34 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献