Author:
Belinsky Alexandra,Kouzaev Guennadi A.
Abstract
AbstractThis work aims to study the virus RNAs using a novel accelerated algorithm on exploring any-length genomic fragments in sequences using Hamming distance between the binary-expressed RNA symbols and explored pattern characters. The found repetitive genomic sub-sequences of different lengths were placed on one plot as genomic trajectories (walks) to increase the effectiveness of geometrical multi-scale genomic studies. Primary attention was paid to the building and analysis of the atg-triplet walks composing the schemes or skeletons of the viral RNAs. The 1-D distributions of these codon-starting atg-triplets were built with the single-symbol walks for full-scale analyses. The visual examination was followed by calculating statistical parameters of genomic sequences, including the estimation of geometry deviation and fractal properties of inter-atg distances. This approach was applied to the SARS CoV-2, MERS CoV, Dengue and Ebola viruses, whose complete genomic sequences are taken from GenBank and GISAID databases. Relative stability of these distributions for SARS CoV-2 and MERS CoV viruses was found, unlike the Dengue and Ebola distributions that showed increased deviation of their geometrical and fractal characteristics of atg-distributions.
Publisher
Cold Spring Harbor Laboratory
Reference75 articles.
1. G. Meister , RNA Biology: An Introduction. Weinheim, Wiley-VCH, 2011.
2. RNA sequencing and analysis;Cold Spring Harb Protoc,2015
3. An Expanding Universe of Noncoding RNAs
4. C. Nello and M.W. Hahn , Introduction to Computational Genomics: A Case Studies Approach. Cambridge, University Press, 2012. https://doi.org/10.1017/CBO9780511808982
5. H.K. Kwan and S.B. Arniker , Numerical representation of DNA sequences. Proc. 2009 IEEE International Conference on Electro/Information Technology, 2009, pp. 307–310. http://dx.doi.org/10.1109/EIT.2009.5189632
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献