Author:
Li Qi,Button-Simons Katrina A,Sievert Mackenzie AC,Chahoud Elias,Foster Gabriel F,Meis Kaitlynn,Ferdig Michael T,Milenković Tijana
Abstract
AbstractBackgroundMalaria results in more than 550,000 deaths each year due to drug resistance in the most lethalPlasmodium(P.) speciesP. falciparum. A fullP. falciparumgenome was published in 2002, yet 44.6% of its genes have unknown functions. Improving functional annotation of genes is important for identifying drug targets and understanding the evolution of drug resistance.ResultsGenes function by interacting with one another. So, analyzing gene co-expression networks can enhance functional annotations and prioritize genes for wet lab validation. Earlier efforts to build gene co-expression networks inP. falciparumhave been limited to a single network inference method or gaining biological understanding for only a single gene and its interacting partners. Here, we explore multiple inference methods and aim to systematically predict functional annotations for allP. falciparumgenes. We evaluate each inferred network based on how well it predicts existing gene-Gene Ontology (GO) term annotations using network clustering and leave-one-out cross-validation. We assess overlaps of the different networks’ edges (gene co-expression relationships) as well as predicted functional knowledge. The networks’ edges are overall complementary: 47%-85% of all edges are unique to each network. In terms of accuracy of predicting gene functional annotations, all networks yield relatively high precision (as high as 87% for the network inferred using mutual information), but the highest recall reached is below 15%. All networks having low recall means that none of them capture a large amount of all existing gene-GO term annotations. In fact, their annotation predictions are highly complementary, with the largest pairwise overlap of only 27%. We provide ranked lists of inferred gene-gene interactions and predicted gene-GO term annotations for future use and wet lab validation by the malaria community.ConclusionsThe different networks seem to capture different aspects of theP. falciparumbiology in terms of both inferred interactions and predicted gene functional annotations. Thus, relying on a single network inference method should be avoided when possible.Supplementary dataAttached.Availability and implementationAll data and code are available athttps://nd.edu/~cone/pfalGCEN/.Contacttmilenko@nd.edu
Publisher
Cold Spring Harbor Laboratory