Choice of Target in the Genomes of Prototypic Strains to Recognize Subgenus of Coronaviruses
-
Published:2023-07-15
Issue:2
Volume:18
Page:267-281
-
ISSN:1994-6538
-
Container-title:Mathematical Biology and Bioinformatics
-
language:
-
Short-container-title:Math.Biol.Bioinf.
Author:
Chaley M.B.,Kutyrkin V.A.
Abstract
Targeted approach to recognition of coronavirus subgenus on the base of codon frequency distribution in the N-gene of nucleocapsid protein was proposed in the work. Deviation of codon frequency distribution in the N-gene of coronavirus genome analyzed from the same distributions for the 67 prototypic strains, which characterize the 23 subgenera in the four coronavirus genera, is calculated on the base of statistics in the approach proposed. The smallest value of such a deviation from certain prototypic strain points at subgenus to which this strain belongs. The approach proposed appeared to be effective and supports significance for recognizing coronavirus subgenus at least 99 %. Populations of the 38 and 7 codons providing for needed efficiency level were selected out of all codons of the genetic code in accordance with their frequency distribution. The codons from the populations outlined fix taxonomic structure of coronavirus subgenus.
Publisher
Institute of Mathematical Problems of Biology of RAS (IMPB RAS)
Subject
Applied Mathematics,Biomedical Engineering
Reference33 articles.
1. AUTOMATION AND MATHEMATICAL APPARATUS FOR THE ANALYSIS OF GENOMICS DATA
2. GISAID. https://gisaid.org (accessed 14.06.2023).
3. GenBank. https://www.ncbi.nlm.nih.gov/genbank (accessed 14.06.2023).
4. ENA. https://www.ebi.ac.uk/ena/browser/home (accessed 14.06.2023).
5. CNGBdb. https://db.cngb.org (accessed 14.06.2023).