SANA: cross-species prediction of Gene Ontology GO annotations via topological network alignment-Reference-Cited by-同舟云学术

SANA: cross-species prediction of Gene Ontology GO annotations via topological network alignment

Published:2022-07-20 Issue:1 Volume:8 Page:
ISSN:2056-7189
Container-title:npj Systems Biology and Applications
language:en
Short-container-title:npj Syst Biol Appl

Author:

Wang Siyue,Atkinson Giles R. S.,Hayes Wayne B.^ORCID

Abstract

AbstractTopological network alignment aims to align two networks node-wise in order to maximize the observed common connection (edge) topology between them. The topological alignment of two protein–protein interaction (PPI) networks should thus expose protein pairs with similar interaction partners allowing, for example, the prediction of common Gene Ontology (GO) terms. Unfortunately, no network alignment algorithm based on topology alone has been able to achieve this aim, though those that include sequence similarity have seen some success. We argue that this failure of topology alone is due to the sparsity and incompleteness of the PPI network data of almost all species, which provides the network topology with a small signal-to-noise ratio that is effectively swamped when sequence information is added to the mix. Here we show that the weak signal can be detected using multiple stochastic samples of “good” topological network alignments, which allows us to observe regions of the two networks that are robustly aligned across multiple samples. The resulting network alignment frequency (NAF) strongly correlates with GO-based Resnik semantic similarity and enables the first successful cross-species predictions of GO terms based on topology-only network alignments. Our best predictions have an AUPR of about 0.4, which is competitive with state-of-the-art algorithms, even when there is no observable sequence similarity and no known homology relationship. While our results provide only a “proof of concept” on existing network data, we hypothesize that predicting GO terms from topology-only network alignments will become increasingly practical as the volume and quality of PPI network data increase.

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Computer Science Applications,Drug Discovery,General Biochemistry, Genetics and Molecular Biology,Modeling and Simulation

Link

https://www.nature.com/articles/s41540-022-00232-x.pdf

Reference111 articles.

1. Furuse, M., Fujita, K., Hiiragi, T., Fujimoto, K. & Tsukita, S. Claudin-1 and -2: novel integral membrane proteins localizing at tight junctions with no sequence similarity to occludin. J. Cell Biol. 141, 1539–1550 (1998).

2. Fisher, S., Grice, E. A., Vinton, R. M., Bessling, S. L. & McCallion, A. S. Conservation of ret regulatory function from human to zebrafish without sequence similarity. Science 312, 276–279 (2006).

3. Schlicker, A., Domingues, F. S., Rahnenführer, J. & Lengauer, T. A new measure for functional similarity of gene products based on gene ontology. BMC Bioinformatics 7, 302 (2006).

4. Kabsch, W. & Sander, C. On the use of sequence homologies to predict protein structure: identical pentapeptides can have completely different conformations. Proc. Natl Acad. Sci. USA 81, 1075–1078 (1984).

5. Morrone, A. et al. The denatured state dictates the topology of two proteins with almost identical sequence but different native structure and function. J. Biol. Chem. 286, 3863–3872 (2011).

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. New GO-based measures in multiple network alignment;Bioinformatics;2024-07-31

2. Exact p-values for global network alignments via combinatorial analysis of shared GO terms;Journal of Mathematical Biology;2024-03-29

3. Boosting-based ensemble of global network aligners for PPI network alignment;Expert Systems with Applications;2023-11

4. CACO: A Core-Attachment Method With Cross-Species Functional Ortholog Information to Detect Human Protein Complexes;IEEE Journal of Biomedical and Health Informatics;2023-09

5. Multi-SANA: Comparing Measures of Topological Similarity for Multiple Network Alignment;IEEE Transactions on Evolutionary Computation;2022-10