Abstract
Background
Escherichia phage CMSTMSU which isolated from shrimp farm effluent water was sequenced the whole genome and find out the gene ontology, predicted protein statistics, enzyme classes and predicting the secondary structure analysis of protein etc. Bioinformatics tools like CANU algorithm for interpretation of incomplete assemblies, Blast2GO tools to study the Gene Ontology, Mauve system for constructing multiple genome alignments related to evolutionary events Rapid annotation using subsystems technology (RAST) used to annotate protein sequences and Iterative Threading Assembly Refinement (I-TASSER) used protein secondary structure prediction were carried out.
Results
According to sequencing results, the genome's average read length was found to be 3.4 Kb, with a total of 88676 reads observed. A single contig with 35.6% GC content and a genome size of 386.4 kb was produced by the CANU assembly. 767 genes and 700 proteins were annotated, according to the BLAST2GO mapping, and BLAST hits demonstrated sequence similarity of at least 70% to the known protein sequences in the nr database. A study of the KEGG pathway revealed that 18 sequences were involved in the metabolism of thiamine, 26 sequences in the metabolism of pyrdimine, and 37 sequences in the metabolism of purines. Escherichia phage CMSTMSU was found to be closely related to Escherichia PBECO 4, Escherichia Phage 121Q, and Escherichia phage vB Eco slurp01, according to MAUV alignment. Escherichia phage CMSTMSU protein was found to have a close relationship with other phages, such as Salmonella phage munch, Klebsiella phage vB KleM RaK2, and Cronobacter phage CsaM GAP32, according to secondary structure prediction.
Conclusions
Compared to other reference phages in our study, it is noteworthy that the newly isolated phages' large genome sequences have the potential to spark curiosity from a biological standpoint.