Affiliation:
1. The Chinese University of Hong Kong, Hong Kong, Hong Kong
2. Centre for Perceptual and Interactive Intelligence (CPII), Hong Kong, Hong Kong
Abstract
Graph embedding learning computes an embedding vector for each node in a graph and finds many applications in areas such as social networks, e-commerce, and medicine. We observe that existing graph embedding systems (e.g., PBG, DGL-KE, and Marius) have long CPU time and high CPU-GPU communication overhead, especially when using multiple GPUs. Moreover, it is cumbersome to implement negative sampling algorithms on them, which have many variants and are crucial for model quality. We propose a new system called GE
2
, which achieves both <u>g</u>enerality and <u>e</u>fficiency for <u>g</u>raph <u>e</u>mbedding learning. In particular, we propose a general execution model that encompasses various negative sampling algorithms. Based on the execution model, we design a user-friendly API that allows users to easily express negative sampling algorithms. To support efficient training, we offload operations from CPU to GPU to enjoy high parallelism and reduce CPU time. We also design COVER, which, to our knowledge, is the first algorithm to manage data swap between CPU and multiple GPUs for small communication costs. Extensive experimental results show that, comparing with the state-of-the-art graph embedding systems, GE
2
trains consistently faster across different models and datasets, where the speedup is usually over 2x and can be up to 7.5x.
Funder
The Research Matching Grant Scheme (RMGS) of Hong Kong
The University Grants Committee of Hong Kong
Publisher
Association for Computing Machinery (ACM)
Reference69 articles.
1. Antoine Bordes, Nicolas Usunier, Alberto Garc'i a-Durá n, Jason Weston, and Oksana Yakhnenko. 2013. Translating Embeddings for Modeling Multi-relational Data. In Annual Conference on Neural Information Processing Systems 2013. December 5--8, 2013, Lake Tahoe, Nevada, United States. 2787--2795. https://proceedings.neurips.cc/paper/2013/hash/1cecc7a77928ca8133fa24680a88d2f9-Abstract.html
2. Knowledge Graph-based Event Embedding Framework for Financial Quantitative Investments
3. Jingtao Ding, Yuhan Quan, Quanming Yao, Yong Li, and Depeng Jin. 2020. Simplify and Robustify Negative Sampling for Implicit Collaborative Filtering. In Annual Conference on Neural Information Processing Systems 2020, December 6--12, 2020, virtual. https://proceedings.neurips.cc/paper/2020/hash/0c7119e3a6a2209da6a5b90e5b5b75bd-Abstract.html
4. Self-Contrastive Learning with Hard Negative Sampling for Self-supervised Point Cloud Learning