Exploratory Data Analysis of Genomic Sequence of Variants of SARS-CoV-2 Reveals Sequence Divergence and Mutational Localization-Reference-Cited by-同舟云学术

Exploratory Data Analysis of Genomic Sequence of Variants of SARS-CoV-2 Reveals Sequence Divergence and Mutational Localization

Published:2022-01 Issue: Volume:16 Page:117793222211262
ISSN:1177-9322
Container-title:Bioinformatics and Biology Insights
language:en
Short-container-title:Bioinform Biol Insights

Author:

Sangeet Satyam¹,Khan Arshad¹

Affiliation:

1. Department of Biological Science and Engineering, Maulana Azad National Institute of Technology, Bhopal, India

Abstract

Whole genome sequencing has rapidly progressed in recent years, with sequencing the SARS-CoV-2 genomes, making it a more reliable clinical tool for public health surveillance. This development has resulted in the production of a large amount of genomic data used for various types of genomic exploration. However, without a proper standard protocol, the usage of genomic data for analyzing various biological phenomena, such as mutation and evolution, may result in a propagating risk of using an unvalidated data set. This process could lead to irregular data being generated along with a high risk of altered analysis. Thus, the current study lays out the foundation for a preprocess pipeline using data analysis to analyze the genomic data set for its accuracy. We have used the recent example of SARS-CoV-2 to demonstrate the process overflow that can be utilized for various kinds of biological exploration such as understanding mutational events, evolutionary divergence, and speciation. Our analysis reveals a significant amount of sequence divergence in the gamma variant as compared with the reference genome thereby making the variant less infective and deadly. Moreover, we found regions in the genomic sequence that is more prone to mutational localization thereby altering the structural integrity of the virus resulting in a more reliable molecular viral mechanism. We believe that the current work will help for an initial check of the genomic data followed by the biological assessment of the process overflow which will be beneficial for the variant analysis and mutational uprising.

Publisher

SAGE Publications

Subject

Applied Mathematics,Computational Mathematics,Computer Science Applications,Molecular Biology,Biochemistry

Link

http://journals.sagepub.com/doi/pdf/10.1177/11779322221126294

Reference6 articles.

1. COVID-19 infection: Emergence, transmission, and characteristics of human coronaviruses

2. COVID-19: Emergence, Spread, Possible Treatments, and Global Burden

3. The species Severe acute respiratory syndrome-related coronavirus: classifying 2019-nCoV and naming it SARS-CoV-2

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Evolution of Sequence and Structure of SARS-CoV-2 Spike Protein: A Dynamic Perspective;ACS Omega;2023-06-21