Exploring the Performance of Continuous-Time Dynamic Link Prediction Algorithms
-
Published:2024-04-22
Issue:8
Volume:14
Page:3516
-
ISSN:2076-3417
-
Container-title:Applied Sciences
-
language:en
-
Short-container-title:Applied Sciences
Author:
Romero Raphaël1ORCID, Buyl Maarten1ORCID, De Bie Tijl1ORCID, Lijffijt Jefrey1ORCID
Affiliation:
1. AIDA, IDLab-ELIS, Department of Engineering and Architecture, Ghent University, 9052 Ghent, Belgium
Abstract
Dynamic Link Prediction (DLP) addresses the prediction of future links in evolving networks. However, accurately portraying the performance of DLP algorithms poses challenges that might impede progress in the field. Importantly, common evaluation pipelines usually calculate ranking or binary classification metrics, where the scores of observed interactions (positives) are compared with those of randomly generated ones (negatives). However, a single metric is not sufficient to fully capture the differences between DLP algorithms, and is prone to overly optimistic performance evaluation. Instead, an in-depth evaluation should reflect performance variations across different nodes, edges, and time segments. In this work, we contribute tools to perform such a comprehensive evaluation. (1) We propose Birth–Death diagrams, a simple but powerful visualization technique that illustrates the effect of time-based train–test splitting on the difficulty of DLP on a given dataset. (2) We describe an exhaustive taxonomy of negative sampling methods that can be used at evaluation time. (3) We carry out an empirical study of the effect of the different negative sampling strategies. Our comparison between heuristics and state-of-the-art memory-based methods on various real-world datasets confirms a strong effect of using different negative sampling strategies on the test area under the curve (AUC). Moreover, we conduct a visual exploration of the prediction, with additional insights on which different types of errors are prominent over time.
Funder
Special Research Fund (BOF) of Ghent University Flemish Government under the “Onderzoeksprogramma Artificiële Intelligentie (AI) Vlaanderen” programme FWO
Reference45 articles.
1. Yoon, M., Hooi, B., Shin, K., and Faloutsos, C. (2019, January 4–8). Fast and Accurate Anomaly Detection in Dynamic Graphs with a Two-Pronged Approach. Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Anchorage, AK, USA. 2. Machens, A., Gesualdo, F., Rizzo, C., Tozzi, A.E., Barrat, A., and Cattuto, C. (2013). An Infectious Disease Model on Empirical Networks of Human Contact: Bridging the Gap between Dynamic Network Data and Contact Matrices. BMC Infect. Dis., 13. 3. Cadena, J., Sales, A.P., Lam, D., Enright, H.A., Wheeler, E.K., and Fischer, N.O. (2020). Modeling the Temporal Network Dynamics of Neuronal Cultures. PLoS Comput. Biol., 16. 4. Boulicaut, J.F., Esposito, F., Giannotti, F., and Pedreschi, D. (2004). Machine Learning: ECML 2004, Springer. Lecture Notes in Computer Science. 5. Reality Mining: Sensing Complex Social Systems;Eagle;Pers. Ubiquitous Comput.,2006
|
|