Mining and analysis of microsatellites in human coronavirus genomes using the in-house built Java pipeline-Reference-Cited by-同舟云学术

Mining and analysis of microsatellites in human coronavirus genomes using the in-house built Java pipeline

Published:2022-09-30 Issue:3 Volume:20 Page:e35
ISSN:2234-0742
Container-title:Genomics & Informatics
language:en
Short-container-title:Genomics Inform

Author:

Umang ^ORCID,Bharti P. K.^ORCID,Husai Akhtar^ORCID

Abstract

Microsatellites or simple sequence repeats are motifs of 1 to 6 nucleotides in length present in both coding and non-coding regions of DNA. These are found widely distributed in the whole genome of prokaryotes, eukaryotes, bacteria, and viruses and are used as molecular markers in studying DNA variations, gene regulation, genetic diversity and evolutionary studies, etc. However, in vitro microsatellite identification proves to be time-consuming and expensive. Therefore, the present research has been focused on using an in-house built java pipeline to identify, analyse, design primers and find related statistics of perfect and compound microsatellites in the seven complete genome sequences of coronavirus, including the genome of coronavirus disease 2019, where the host is Homo sapiens. Based on search criteria among seven genomic sequences, it was revealed that the total number of perfect simple sequence repeats (SSRs) found to be in the range of 76 to 118 and compound SSRs from 01 to10, thus reflecting the low conversion of perfect simple sequence to compound repeats. Furthermore, the incidence of SSRs was insignificant but positively correlated with genome size (R2 = 0.45, p > 0.05), with simple sequence repeats relative abundance (R2 = 0.18, p > 0.05) and relative density (R2 = 0.23, p > 0.05). Dinucleotide repeats were the most abundant in the coding region of the genome, followed by tri, mono, and tetra. This comparative study would help us understand the evolutionary relationship, genetic diversity, and hypervariability in minimal time and cost.

Publisher

Korea Genome Organization

Subject

Health Informatics,Genetics,Ecology, Evolution, Behavior and Systematics

Link

http://genominfo.org/upload/pdf/gi-20033.pdf

Reference49 articles.

1. History and Recent Advances in Coronavirus Discovery

2. Coronavirus Infections—More Than Just the Common Cold

3. Prevalence and genetic diversity analysis of human coronaviruses among cross-border children

4. Coevolution between simple sequence repeats (SSRs) and virus genome size

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. In silico analysis on frequency and distribution of microsatellites in genes associated with spinal cord astrocytoma;Human Gene;2024-09