Abstract
AbstractHepatocystisare apicomplexan parasites nested within thePlasmodiumgenus that infect primates and other vertebrates, yet few isolates have been genetically characterized. Using taxonomic classification and mapping characteristics, we searched forHepatocystisinfections within publicly available, blood-derived low coverage whole genome sequence (lcWGS) data from 326 wild non-human primates (NHPs) in 17 genera. We identified 30Hepatocystisinfections inChlorocebusandPapiosamples collected from locations in west, east, and south Africa.Hepatocystis cytbsequences fromPapiohosts phylogenetically clustered with previously reported isolates from multiple NHP taxa whereas sequences fromChlorocebushosts form a separate cluster, suggesting they represent a new host-specific clade ofHepatocystis.Additionally, there was no geographic clustering ofHepatocystisisolates suggesting both clades ofHepatocystiscould be found in NHPs throughout sub-Saharan Africa. Across the genome, windows of high SNP density revealed candidate hypervariable loci includingHepatocystis-specific gene families possibly involved in immune evasion and genes that may be involved in adaptation to their insect vector and hepatocyte invasion. Overall, this work demonstrates how lcWGS data from wild NHPs can be leveraged to study the evolution of apicomplexan parasites and potentially test for association between host genetic variation and parasite infection.Author SummaryNon-human primates are hosts to many species ofPlasmodium, the parasites that cause malaria, and a closely related group of parasites calledHepatocystis. However, due to restrictions and challenges of sampling from wild populations, we lack a complete understanding of the breadth of diversity and distribution of these parasites. Here, we provide a framework for testing already-sampled populations for parasite infections using whole genome sequences derived from whole blood samples from the host. Following taxonomic classification of these sequences using a database of reference genomes, we mapped reads to candidate parasite genomes and used an unsupervised clustering algorithm including coverage metrics to further validate infection inferences. Through this approach, we identified 30Hepatocystisinfections from two genetically distinct clades ofHepatocystisin African non-human primates and described genes that may be under immune selection in each. Most importantly, the framework here can be applied to additional sequencing datasets from non-human primates and other vertebrate hosts as well as datasets from invertebrate vectors. Therefore, this approach could greatly improve our understanding of where these parasites are found, their host-specificity, and their evolutionary history. This framework may also be adapted to study evolution in other host-pathogen groups.
Publisher
Cold Spring Harbor Laboratory