Author:
Swift S.,Payne A.,Pavlidis S.
Abstract
SummaryBackground: Pathway based microarray analysis is an effort to integrate microarray and pathway data in a holistic analytical approach, looking for coordinated changes in the expression of sets of genes forming pathways. However, it has been observed that the results produced are often cryptic, with cases of closely related genes in a pathway showing quite variable, even opposing expression.Objectives: We propose a methodology to identify the state of activation of individual pathways, based on our hypothesis that gene members of many pathways or modules exhibit differential expression that results from their contribution to any combination of all their constituent pathways. Therefore, the observed expression of such a gene does not necessarily imply the activation state of a given pathway where its product participates, but reflects the net expression resulting from its participation in all its constituent pathways.Methods: Firstly, in an effort to validate the hypothesis, we split the genes into two groups; single and multi-membership. We then determined and compared the proportion of differentially expressed genes in each group, for each experiment. In addition, we estimated the cumulative binomial probability of observing as many or more expressed genes in each group, in each experiment, simply by chance. Second, we propose a hill climbing methodology, aiming to maximise the agreement of gene expression per module.Results: We detected more frequent expression of multi-membership genes and significantly lower probabilities of observing such a high proportion of differentially expressed multi-membership genes, as the one present in the dataset. The algorithm was able to correctly identify the state of activation of the KEGG glycolysis and gluconeogenesis modules, using a number of Saccharomyces cerevisiae datasets. We show that the result is equivalent to the best solution found following exhaustive search.Conclusions: The proposed method takes into account the multi-membership nature of genes and our knowledge of the competitive nature of our exemplar modules, revealing the state of activity of a pathway.
Subject
Health Information Management,Advanced and Specialized Nursing,Health Informatics