GE 2 : A General and Efficient Knowledge Graph Embedding Learning System-Reference-Cited by-同舟云学术

GE ² : A General and Efficient Knowledge Graph Embedding Learning System

Published:2024-05-29 Issue:3 Volume:2 Page:1-27
ISSN:2836-6573
Container-title:Proceedings of the ACM on Management of Data
language:en
Short-container-title:Proc. ACM Manag. Data

Author:

Zheng Chenguang¹^ORCID,Jiang Guanxian¹^ORCID,Yan Xiao²^ORCID,Yin Peiqi¹^ORCID,Zhou Qihui¹^ORCID,Cheng James¹^ORCID

Affiliation:

1. The Chinese University of Hong Kong, Hong Kong, Hong Kong

2. Centre for Perceptual and Interactive Intelligence (CPII), Hong Kong, Hong Kong

Abstract

Graph embedding learning computes an embedding vector for each node in a graph and finds many applications in areas such as social networks, e-commerce, and medicine. We observe that existing graph embedding systems (e.g., PBG, DGL-KE, and Marius) have long CPU time and high CPU-GPU communication overhead, especially when using multiple GPUs. Moreover, it is cumbersome to implement negative sampling algorithms on them, which have many variants and are crucial for model quality. We propose a new system called GE 2 , which achieves both generality and efficiency for graph embedding learning. In particular, we propose a general execution model that encompasses various negative sampling algorithms. Based on the execution model, we design a user-friendly API that allows users to easily express negative sampling algorithms. To support efficient training, we offload operations from CPU to GPU to enjoy high parallelism and reduce CPU time. We also design COVER, which, to our knowledge, is the first algorithm to manage data swap between CPU and multiple GPUs for small communication costs. Extensive experimental results show that, comparing with the state-of-the-art graph embedding systems, GE 2 trains consistently faster across different models and datasets, where the speedup is usually over 2x and can be up to 7.5x.

Funder

The Research Matching Grant Scheme (RMGS) of Hong Kong

The University Grants Committee of Hong Kong

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3654986

Reference69 articles.

1. Antoine Bordes, Nicolas Usunier, Alberto Garc'i a-Durá n, Jason Weston, and Oksana Yakhnenko. 2013. Translating Embeddings for Modeling Multi-relational Data. In Annual Conference on Neural Information Processing Systems 2013. December 5--8, 2013, Lake Tahoe, Nevada, United States. 2787--2795. https://proceedings.neurips.cc/paper/2013/hash/1cecc7a77928ca8133fa24680a88d2f9-Abstract.html

2. Knowledge Graph-based Event Embedding Framework for Financial Quantitative Investments

3. Jingtao Ding, Yuhan Quan, Quanming Yao, Yong Li, and Depeng Jin. 2020. Simplify and Robustify Negative Sampling for Implicit Collaborative Filtering. In Annual Conference on Neural Information Processing Systems 2020, December 6--12, 2020, virtual. https://proceedings.neurips.cc/paper/2020/hash/0c7119e3a6a2209da6a5b90e5b5b75bd-Abstract.html

4. Self-Contrastive Learning with Hard Negative Sampling for Self-supervised Point Cloud Learning