Prediction of Recurrent Mutations in SARS-CoV-2 Using Artificial Neural Networks-Reference-Cited by-同舟云学术

Prediction of Recurrent Mutations in SARS-CoV-2 Using Artificial Neural Networks

Published:2022-11-24 Issue:23 Volume:23 Page:14683
ISSN:1422-0067
Container-title:International Journal of Molecular Sciences
language:en
Short-container-title:IJMS

Author:

Saldivar-Espinoza Bryan^ORCID,Macip Guillem^ORCID,Garcia-Segura Pol^ORCID,Mestres-Truyol Júlia,Puigbò Pere^ORCID,Cereto-Massagué Adrià^ORCID,Pujadas Gerard^ORCID,Garcia-Vallve Santiago^ORCID

Abstract

Predicting SARS-CoV-2 mutations is difficult, but predicting recurrent mutations driven by the host, such as those caused by host deaminases, is feasible. We used machine learning to predict which positions from the SARS-CoV-2 genome will hold a recurrent mutation and which mutations will be the most recurrent. We used data from April 2021 that we separated into three sets: a training set, a validation set, and an independent test set. For the test set, we obtained a specificity value of 0.69, a sensitivity value of 0.79, and an Area Under the Curve (AUC) of 0.8, showing that the prediction of recurrent SARS-CoV-2 mutations is feasible. Subsequently, we compared our predictions with updated data from January 2022, showing that some of the false positives in our prediction model become true positives later on. The most important variables detected by the model’s Shapley Additive exPlanation (SHAP) are the nucleotide that mutates and RNA reactivity. This is consistent with the SARS-CoV-2 mutational bias pattern and the preference of some host deaminases for specific sequences and RNA secondary structures. We extend our investigation by analyzing the mutations from the variants of concern Alpha, Beta, Delta, Gamma, and Omicron. Finally, we analyzed amino acid changes by looking at the predicted recurrent mutations in the M-pro and spike proteins.

Funder

European Union’s Horizon 2020 research and innovation program under the Marie Skłodowska-Curie

Universitat Rovira i Virgili

Publisher

MDPI AG

Subject

Inorganic Chemistry,Organic Chemistry,Physical and Theoretical Chemistry,Computer Science Applications,Spectroscopy,Molecular Biology,General Medicine,Catalysis

Link

https://www.mdpi.com/1422-0067/23/23/14683/pdf

Reference83 articles.

1. A New Coronavirus Associated with Human Respiratory Disease in China;Nature,2020

2. The Architecture of SARS-CoV-2 Transcriptome;Cell,2020

3. Emerging Coronaviruses: Genome Structure, Replication, and Pathogenesis;J. Med. Virol.,2020

4. Wang, R., Hozumi, Y., Zheng, Y.-H., Yin, C., and Wei, G.-W. (2020). Host Immune Response Driving SARS-CoV-2 Evolution. Viruses, 12.

5. Are RNA Viruses Candidate Agents for the Next Global Pandemic? A Review;ILAR J.,2017

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Properties and Mechanisms of Deletions, Insertions, and Substitutions in the Evolutionary History of SARS-CoV-2;International Journal of Molecular Sciences;2024-03-26

2. Broad Epitope Coverage of Therapeutic Multi-Antibody Combinations Targeting SARS-CoV-2 Boosts In Vivo Protection and Neutralization Potency to Corner an Immune-Evading Virus;Biomedicines;2024-03-13

3. Computational methods for studying relationship between nutritional status and respiratory viral diseases: a systematic review;Artificial Intelligence Review;2024-01

4. The Mutational Landscape of SARS-CoV-2;International Journal of Molecular Sciences;2023-05-22

5. A Simple Epidemiologic Model for Predicting Impaired Neutralization of New SARS-CoV-2 Variants;Vaccines;2023-01-05