Affiliation:
1. National Cancer Institute
2. Arizona State University
3. University of Cape Town
Abstract
The initial objective of this study was to shed light on the evolution of small DNA tumor viruses by analyzing
de novo
assemblies of publicly available deep sequencing datasets. The survey generated a searchable database of contig snapshots representing more than 100,000 Sequence Read Archive records. Using modern structure-aware search tools, we iteratively broadened the search to include an increasingly wide range of other virus families. The analysis revealed a surprisingly diverse range of chimeras involving different virus groups. In some instances, genes resembling known DNA-replication modules or known virion protein operons were paired with unrecognizable sequences that structural predictions suggest may represent previously unknown replicases and novel virion architectures. Discrete clades of an emerging group called adintoviruses were discovered in datasets representing humans and other primates. As a proof of concept, we show that the contig database is also useful for discovering RNA viruses and candidate archaeal phages. The ancillary searches revealed additional examples of chimerization between different virus groups. The observations support a gene-centric taxonomic framework that should be useful for future virus-hunting efforts.
Publisher
eLife Sciences Publications, Ltd
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献