Abstract
AbstractThe phylumPreplasmiviricota(kingdomBamfordvirae, realmVaridnaviria) is a broad assemblage of diverse viruses with comparatively short double-stranded DNA genomes (<50 kbp) that produce icosahedral capsids built from double jelly-roll major capsid proteins. Preplasmiviricots infect hosts from all cellular domains, testifying to their ancient origin and, in particular, are associated with six of the seven supergroups of eukaryotes. Preplasmiviricots comprise four major groups of viruses, namely, polintons, polinton-like viruses (PLVs), virophages, and adenovirids. We employed protein structure modeling and analysis to show that protein-primed DNA polymerases (pPolBs) of polintons, virophages, and cytoplasmic linear plasmids encompass an N-terminal domain homologous to the terminal proteins (TPs) of prokaryotic PRD1-like tectivirids and eukaryotic adenovirids that are involved in protein-primed replication initiation, followed by a viral ovarian tumor-like cysteine deubiquitinylase (vOTU) domain. The vOTU domain is likely responsible for the cleavage of the TP from the large pPolB polypeptide and is inactivated in adenovirids, in which TP is a separate protein. Many PLVs and transpovirons encode a distinct derivative of polinton-like pPolB that retains the TP, vOTU and pPolB polymerization palm domains but lacks the exonuclease domain and instead contains a supefamily 1 helicase domain. Analysis of the presence/absence and inactivation of the vOTU domains, and replacement of pPolB with other DNA polymerases in eukaryotic preplasmiviricots enabled us to outline a complete scenario for their origin and evolution.SignificanceStructural modeling of protein domains using advanced artificial intelligence-based methods such as AlphaFold2 may lead to insights into evolutionary relationships among proteins that are unreachable by sequence analysis. We applied this approach to elucidate the evolutionary relationships of four major groups of eukaryotic viruses: polintons, polinton-like viruses (PLVs), virophages, and adenovirids. We identified previously uncharacterized protein domains predicted to be essential for virus genome replication. Analysis of the presence/absence and inactivation of these domains suggests a complete scenario for the origin and evolution of this major part of the eukaryotic virosphere.
Publisher
Cold Spring Harbor Laboratory