Affiliation:
1. Centro de Ciencias de la Complejidad, Universidad Nacional Autónoma de México Mexico City México
2. Departamento de Ciencias Biológicas Animales, Facultad de Ciencias Veterinarias y Pecuarias, Universidad de Chile Santiago Chile
Abstract
Billions of genomic sequences and records of species occurrence are available in public repositories (e.g. National Center for Biotechnology Information, NCBI and the Global Biodiversity Information Facility, GBIF). By implementing analytical tools from different scientific disciplines, data mining these databases can aid in the global surveillance of zoonotic pathogens that circulate among wildlife. We illustrate this by investigating the Hantavirus–rodent system in the Americas, i.e. New World Hantaviruses (NWH). First, we considered the circulation of pathogenic NWH among Cricetidae rodents, by inferring the phylogenetic links among 277 genomic samples of the S segment (N protein) of NWH found in 55 species. Second, we used machine learning to assess the impact of land use on the probability of presence of the rodent species linked with reservoirs of pathogenic Hantaviruses. Our results show that hosts are widely present across the Americas. Some hosts are present in the primary forest and agricultural land, but not in the secondary forest, whereas other hosts are present in secondary forest and agricultural land. The diversity of host species allows Hantavirus to circulate in a wide spectrum of habitats, in particular rural rather than urban. We highlight that public repositories of genomic data and species occurrence are very useful resources for monitoring potential enzootic transmission and spillover of zoonotic viruses in relation with the changes that humans produce in the biosphere.