Abstract
AbstractData cataloguing viral diversity on Earth have been fragmented across sources, disciplines, formats, and various degrees of open collation, posing challenges for research on macroecology, evolution, and public health. Here, we solve this problem by establishing a dynamically-maintained database of vertebrate-virus associations, called The Global Virome in One Network (VIRION). The VIRION database has been assembled through both reconciliation of static datasets and integration of dynamically-updated databases. These data sources are all harmonized against one taxonomic backbone, including metadata on host and virus taxonomic validity and higher classification; additional metadata on sampling methodology and evidence strength are also available in a harmonized format. In total, the VIRION database is the largest open-source, open-access database of its kind, with roughly half a million unique records that include 9,521 resolved virus “species” (of which 1,661 are ICTV ratified), 3,692 resolved vertebrate host species, and 23,147 unique interactions between taxonomically-valid organisms. Together, these data cover roughly a quarter of mammal diversity, a tenth of bird diversity, and ~6% of the estimated total diversity of vertebrates, and a much larger proportion of their virome than any previous database. We show how these data can be used to test hypotheses about microbiology, ecology, and evolution, and make suggestions for best practices that address the unique mix of evidence that coexists in these data.
Publisher
Cold Spring Harbor Laboratory
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献