Node similarity-based graph convolution for link prediction in biological networks-Reference-Cited by-同舟云学术

Node similarity-based graph convolution for link prediction in biological networks

Published:2021-06-21 Issue:23 Volume:37 Page:4501-4508
ISSN:1367-4803
Container-title:Bioinformatics
language:en
Short-container-title:

Author:

Coşkun Mustafa¹²^ORCID,Koyutürk Mehmet³⁴

Affiliation:

1. Department of Computer Engineering, Abdullah Gül University, Kayseri, Turkey

2. Hakkari University, Kayseri 38080, Turkey

3. Department of Computer and Data Sciences, Case Western Reserve University, Cleveland, OH 44106, USA

4. Center for Proteomics and Bioinformatics, Case Western Reserve University, Cleveland, OH 44106, USA

Abstract

ABSTRACT Background Link prediction is an important and well-studied problem in network biology. Recently, graph representation learning methods, including Graph Convolutional Network (GCN)-based node embedding have drawn increasing attention in link prediction. Motivation An important component of GCN-based network embedding is the convolution matrix, which is used to propagate features across the network. Existing algorithms use the degree-normalized adjacency matrix for this purpose, as this matrix is closely related to the graph Laplacian, capturing the spectral properties of the network. In parallel, it has been shown that GCNs with a single layer can generate more robust embeddings by reducing the number of parameters. Laplacian-based convolution is not well suited to single-layered GCNs, as it limits the propagation of information to immediate neighbors of a node. Results Capitalizing on the rich literature on unsupervised link prediction, we propose using node similarity-based convolution matrices in GCNs to compute node embeddings for link prediction. We consider eight representative node-similarity measures (Common Neighbors, Jaccard Index, Adamic-Adar, Resource Allocation, Hub- Depressed Index, Hub-Promoted Index, Sorenson Index and Salton Index) for this purpose. We systematically compare the performance of the resulting algorithms against GCNs that use the degree-normalized adjacency matrix for convolution, as well as other link prediction algorithms. In our experiments, we use three-link prediction tasks involving biomedical networks: drug–disease association prediction, drug–drug interaction prediction and protein–protein interaction prediction. Our results show that node similarity-based convolution matrices significantly improve the link prediction performance of GCN-based embeddings. Conclusion As sophisticated machine-learning frameworks are increasingly employed in biological applications, historically well-established methods can be useful in making a head-start. Availability and implementation Our method, SiGraC, is implemented as a Python library and is freely available at https://github.com/mustafaCoskunAgu/SiGraC.

Funder

US National Institutes of Health

National Cancer Institute

Publisher

Oxford University Press (OUP)

Subject

Computational Mathematics,Computational Theory and Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Statistics and Probability

Link

http://academic.oup.com/bioinformatics/advance-article-pdf/doi/10.1093/bioinformatics/btab464/39300687/btab464.pdf

Reference38 articles.

1. Friends and neighbors on the web;Adamic;Soc. Netw,2003

2. The unified medical language system (UMLS): integrating biomedical terminology;Bodenreider;Nucleic Acids Res,2004

3. New directions for diffusion-based network prediction of protein function: incorporating pathways with confidence;Cao;Bioinformatics,2014

4. Compact integration of multi-network topology for functional analysis of genes;Cho;Cell Syst,2016

5. Link Prediction in Large Networks by Comparing the Global View of Nodes in the Network

Cited by 38 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. SFGCN: Synergetic fusion-based graph convolutional networks approach for link prediction in social networks;Information Fusion;2025-02

2. Drug repositioning based on residual attention network and free multiscale adversarial training;BMC Bioinformatics;2024-08-08

3. Subgraph-Aware Graph Kernel Neural Network for Link Prediction in Biological Networks;IEEE Journal of Biomedical and Health Informatics;2024-07

4. PyMulSim: a method for computing node similarities between multilayer networks via graph isomorphism networks;BMC Bioinformatics;2024-06-13

5. A Survey on Graph Representation Learning Methods;ACM Transactions on Intelligent Systems and Technology;2024-01-16