GRAPE for fast and scalable graph processing and random-walk-based embedding-Reference-Cited by-同舟云学术

GRAPE for fast and scalable graph processing and random-walk-based embedding

Published:2023-06-26 Issue:6 Volume:3 Page:552-568
ISSN:2662-8457
Container-title:Nature Computational Science
language:en
Short-container-title:Nat Comput Sci

Author:

Cappelletti Luca^ORCID,Fontana Tommaso^ORCID,Casiraghi Elena,Ravanmehr Vida,Callahan Tiffany J.^ORCID,Cano Carlos,Joachimiak Marcin P.,Mungall Christopher J.,Robinson Peter N.^ORCID,Reese Justin,Valentini Giorgio^ORCID

Abstract

AbstractGraph representation learning methods opened new avenues for addressing complex, real-world problems represented by graphs. However, many graphs used in these applications comprise millions of nodes and billions of edges and are beyond the capabilities of current methods and software implementations. We present GRAPE (Graph Representation Learning, Prediction and Evaluation), a software resource for graph processing and embedding that is able to scale with big graphs by using specialized and smart data structures, algorithms, and a fast parallel implementation of random-walk-based methods. Compared with state-of-the-art software resources, GRAPE shows an improvement of orders of magnitude in empirical space and time complexity, as well as competitive edge- and node-label prediction performance. GRAPE comprises approximately 1.7 million well-documented lines of Python and Rust code and provides 69 node-embedding methods, 25 inference models, a collection of efficient graph-processing utilities, and over 80,000 graphs from the literature and other sources. Standardized interfaces allow a seamless integration of third-party libraries, while ready-to-use and modular pipelines permit an easy-to-use evaluation of graph-representation-learning methods, therefore also positioning GRAPE as a software resource that performs a fair comparison between methods and libraries for graph processing and embedding.

Publisher

Springer Science and Business Media LLC

Subject

Computer Networks and Communications,Computer Science Applications,Computer Science (miscellaneous)

Link

https://www.nature.com/articles/s43588-023-00465-8.pdf

Reference54 articles.

1. Hamilton, W. L. Graph representation learning. Synth. Lect. Artif. Intell. Mach. Learn. 14, 1–159 (2020).

2. Shervashidze, N., Schweitzer, P., Van Leeuwen, E., Mehlhorn, K. & Borgwardt, K. M. Weisfeiler-Lehman graph kernels. J. Mach. Learn. Res. 12, 2539–2561 (2011).

3. Wu, Z., et al. A comprehensive survey on graph neural networks. IEEE transactions on neural networks and learning systems. 32, 4–24 (2020).

4. Csardi, G. & Nepusz, T. The Igraph software package for complex network research. Inter. J. Complex Sys. 1695, 1–9 (2006)

5. Low, Y., Gonzalez, J., Kyrola, A., Bickson, D., Guestrin, C. and Hellerstein, J.M., Graphlab: a new framework for parallel machine learning. In Proc. 26th Conference on Uncertainty in Artificial Intelligence, UAI’10 340–349 (AUAI Press, 2010).

Cited by 15 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An ontology-based knowledge graph for representing interactions involving RNA molecules;Scientific Data;2024-08-22

2. Predicting nutrition and environmental factors associated with female reproductive disorders using a knowledge graph and random forests;International Journal of Medical Informatics;2024-07

3. Measuring Patient Similarities in Clinical Data Repository through Graph Representation;2024 21st International Joint Conference on Computer Science and Software Engineering (JCSSE);2024-06-19

4. Construction and Enhancement of an RNA-Based Knowledge Graph for Discovering New RNA Drugs;2024 IEEE 40th International Conference on Data Engineering (ICDE);2024-05-13

5. An open source knowledge graph ecosystem for the life sciences;Scientific Data;2024-04-11