Unsupervised framework for evaluating and explaining structural node embeddings of graphs-Reference-Cited by-同舟云学术

Unsupervised framework for evaluating and explaining structural node embeddings of graphs

Published:2024-02-21 Issue:2 Volume:12 Page:
ISSN:2051-1329
Container-title:Journal of Complex Networks
language:en
Short-container-title:

Author:

Dehghan Ashkan¹^ORCID,Siuta Kinga¹²,Skorupka Agata¹²,Betlen Andrei³,Miller David³,Kamiński Bogumił²,Prałat Paweł¹

Affiliation:

1. Department of Mathematics, Toronto Metropolitan University , 350 Victoria Street , Toronto, ON M5B 2K3, Canada

2. SGH Warsaw School of Economics , al. Niepodległości 162 , 02-554 Warszawa, Poland

3. Patagona Technologies , Pickering, ON, Canada

Abstract

Abstract An embedding is a mapping from a set of nodes of a network into a real vector space. Embeddings can have various aims like capturing the underlying graph topology and structure, node-to-node relationship, or other relevant information about the graph, its subgraphs or nodes themselves. A practical challenge with using embeddings is that there are many available variants to choose from. Selecting a small set of most promising embeddings from the long list of possible options for a given task is challenging and often requires domain expertise. Embeddings can be categorized into two main types: classical embeddings and structural embeddings. Classical embeddings focus on learning both local and global proximity of nodes, while structural embeddings learn information specifically about the local structure of nodes’ neighbourhood. For classical node embeddings, there exists a framework which helps data scientists to identify (in an unsupervised way) a few embeddings that are worth further investigation. Unfortunately, no such framework exists for structural embeddings. In this article, we propose a framework for unsupervised ranking of structural graph embeddings. The proposed framework, apart from assigning an aggregate quality score for a structural embedding, additionally gives a data scientist insights into properties of this embedding. It produces information which predefined node features the embedding learns, how well it learns them, and which dimensions in the embedded space represent the predefined node features. Using this information, the user gets a level of explainability to an otherwise complex black-box embedding algorithm.

Funder

Canadian Department of National Defense

NSERC

Publisher

Oxford University Press (OUP)

Link

https://academic.oup.com/comnet/article-pdf/12/2/cnae003/56744816/cnae003.pdf

Reference32 articles.

1. Graph based anomaly detection and description: a survey;Akoglu;Data Min. Knowl. Discov,2015