Abstract
Abstract
Background
Hepatitis E virus (HEV) is small (27–34 nm diameter) non-enveloped with positive sense ssRNA genome. Microsatellites or simple sequence repeats (SSR) are short tandem repeat sequences present across coding and non-coding regions of both prokaryotes and eukaryotes. They are involved with genome function and evolution at multiple levels.
Results
The complete genome sequences of 22 HEV genomes of the family Hepeviridae and genus Orthohepevirus (21 species) and Piscihepevirus (1 species) were extracted from NCBI database (http://www.ncbi.nlm.nih.gov/). The extraction of microsatellites was done using Imperfect Microsatellite Extractor (IMEx) in ‘Advance-Mode’. The average genome size of the studied HEV genomes was 7003nt and it ranged from 6649nt (HEV11) to 7310nt (HEV22). The average GC content of the genomes was ~ 55%. A total of 519 SSRs and 21 cSSRS were extracted from the HEV genomes with an average incidence of 24 per genome ranging from 14 (HEV13) to 34 (HEV19). The cSSR incidence ranged from 0 (eight species) to 4 (HEV19). The genomes with no cSSR incidence had an SSR incidence range from 14 to 28. There were just four hexa-nucleotide repeat motifs and 5 penta-nucleotide repeat motifs observed. The most prevalent mono-, di-, and tri-nucleotide repeat motifs were “C”, “GT/TG”, and “GAC/CTG” respectively. The studied genomes had a minimum of ~ 90% incident SSRs present in the coding regions. Viruses with same or similar hosts are placed together on the phylogenetic tree implicating viral host being one of the driving forces for evolution. Conclusions
Host range in viruses is being decided by multiple factors aided by the unique genome SSR signature and genomes of varied compositions need to be analyzed to forge a widely acceptable rule for predicting the same.
Publisher
Springer Science and Business Media LLC
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献