Abstract
AbstractNetwork alignment aims to uncover topologically similar regions in the protein-protein interaction (PPI) networks of two or more species under the assumption that topologically similar regions perform similar functions. Although there exist a plethora of both network alignment algorithms and measures of topological similarity, currently no “gold standard” exists for evaluating how well either is able to uncover functionally similar regions. Here we propose a formal, mathematically and statistically rigorous method for evaluating the statistical significance of shared GO terms in a global, 1-to-1 alignment between two PPI networks. We use combinatorics to precisely count the number of possible network alignments in which k proteins share a particular GO term. When divided by the number of all possible network alignments, this provides an explicit, exact p-value for a network alignment with respect to a particular GO term.
Publisher
Cold Spring Harbor Laboratory
Reference39 articles.
1. The Gene Ontology project in 2008
2. Semantic Similarity in Biomedical Ontologies
3. W.B. Hayes , in Proceedings of ISMB 2020—Intelligent Systems for Molecular Biology (2020)
4. P. Resnik , in Proceedings of the 1fth International Joint Conference on Artificial Intelligence - Volume 1 (Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 1995), IJCAI’95, pp. 448–453. URL http://dl.acm.org/citation.cfm?id=1625855.1625914
5. Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language