Abstract
The COVID-19 pandemic has been characterised by sequential variant-specific waves shaped by viral, individual human and population factors. SARS-CoV-2 variants are defined by their unique combinations of mutations and there has been a clear adaptation to more efficient human infection since the emergence of this new human coronavirus in late 2019. Here, we use machine learning models to identify shared signatures, i.e., common underlying mutational processes and link these to the subset of mutations that define the variants of concern (VOCs). First, we examined the global SARS-CoV-2 genomes and associated metadata to determine how viral properties and public health measures have influenced the magnitude of waves, as measured by the number of infection cases, in different geographic locations using regression models. This analysis showed that, as expected, both public health measures and virus properties were associated with the waves of regional SARS-CoV-2 reported infection numbers and this impact varies geographically. We attribute this to intrinsic differences such as vaccine coverage, testing and sequencing capacity and the effectiveness of government stringency. To assess underlying evolutionary change, we used non-negative matrix factorisation and observed three distinct mutational signatures, unique in their substitution patterns and exposures from the SARS-CoV-2 genomes. Signatures 1, 2 and 3 were biased to C→T, T→C/A→G and G→T point mutations. We hypothesise assignments of these mutational signatures to the host antiviral molecules APOBEC, ADAR and ROS respectively. We observe a shift amidst the pandemic in relative mutational signature activity from predominantly Signature 1 changes to an increasingly high proportion of changes consistent with Signature 2. This could represent changes in how the virus and the host immune response interact and indicates how SARS-CoV-2 may continue to generate variation in the future. Linkage of the detected mutational signatures to the VOC-defining amino acids substitutions indicates the majority of SARS-CoV-2’s evolutionary capacity is likely to be associated with the action of host antiviral molecules rather than virus replication errors.
Funder
Medical Research Council
Wellcome Trust
Department for International Development, UK Government
Engineering and Physical Sciences Research Council
HORIZON EUROPE Innovative Europe
Publisher
Public Library of Science (PLoS)
Reference66 articles.
1. Clinical course and outcomes of critically ill patients with SARS-CoV-2 pneumonia in Wuhan, China: a single-centered, retrospective, observational study;X Yang;The Lancet Respiratory Medicine,2020
2. Early Transmission Dynamics in Wuhan, China, of Novel Coronavirus–Infected Pneumonia;Q Li;New England Journal of Medicine,2020
3. Comparing SARS-CoV-2 with SARS-CoV and influenza pandemics;E Petersen;The Lancet Infectious Diseases,2020
4. Genomic epidemiology reveals multiple introductions of SARS-CoV-2 from mainland Europe into Scotland;A da Silva Filipe;Nature Microbiology,2020
5. Global policy responses to the COVID-19 pandemic: proportionate adaptation and policy experimentation: a study of country policy response variation to the COVID-19 pandemic;A Dewi;Health Promotion Perspectives,2020
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献