Temporal graph learning for dynamic link prediction with text in online social networks
-
Published:2023-11-29
Issue:
Volume:
Page:
-
ISSN:0885-6125
-
Container-title:Machine Learning
-
language:en
-
Short-container-title:Mach Learn
Author:
Dileo ManuelORCID, Zignani MatteoORCID, Gaito SabrinaORCID
Abstract
AbstractLink prediction in Online Social Networks—OSNs—has been the focus of numerous studies in the machine learning community. A successful machine learning-based solution for this task needs to (i) leverage global and local properties of the graph structure surrounding links; (ii) leverage the content produced by OSN users; and (iii) allow their representations to change over time, as thousands of new links between users and new content like textual posts, comments, images and videos are created/uploaded every month. Current works have successfully leveraged the structural information but only a few have also taken into account the textual content and/or the dynamicity of network structure and node attributes. In this paper, we propose a methodology based on temporal graph neural networks to handle the challenges described above. To understand the impact of textual content on this task, we provide a novel pipeline to include textual information alongside the structural one with the usage of BERT language models, dense preprocessing layers, and an effective post-processing decoder. We conducted the evaluation on a novel dataset gathered from an emerging blockchain-based online social network, using a live-update setting that takes into account the evolving nature of data and models. The dataset serves as a useful testing ground for link prediction evaluation because it provides high-resolution temporal information on link creation and textual content, characteristics hard to find in current benchmark datasets. Our results show that temporal graph learning is a promising solution for dynamic link prediction with text. Indeed, combining textual features and dynamic Graph Neural Networks—GNNs—leads to the best performances over time. On average, the textual content can enhance the performance of a dynamic GNN by 3.1% and, as the collection of documents increases in size over time, help even models that do not consider the structural information of the network.
Funder
Università degli Studi di Milano
Publisher
Springer Science and Business Media LLC
Subject
Artificial Intelligence,Software
Reference39 articles.
1. Ba, C. T., Michienzi, A., Guidi, B., Zignani, M., Ricci, L., & Gaito, S. (2022a). Fork-based user migration in blockchain online social media. In 14th ACM web science conference 2022, (pp. 174–184). 2. Ba, C. T., Zignani, M., & Gaito, S. (2022b). The role of cryptocurrency in the dynamics of blockchain-based social networks: The case of steemit. PloS one, 17(6), e0267612. 3. Barracchia, E., Pio, G., Bifet, A., Gomes, H. M., Pfahringer, B., & Ceci, M. (2022). Lp-robin: Link prediction in dynamic networks exploiting incremental node embedding. Information Sciences 606. https://doi.org/10.1016/j.ins.2022.05.079 4. Bruss, C. B., Khazane, A., Rider, J., Serpe, R. T., Gogoglou, A., & Hines, K. E. (2019). Deeptrax: Embedding graphs of financial transactions. In 2019 18th IEEE international conference on machine learning and applications (ICMLA) (pp. 126–133). 5. Chung, J., Gulcehre, C., Cho, K., & Bengio, Y. (2014). Empirical evaluation of gated recurrent neural networks on sequence modeling. In NIPS 2014 workshop on deep learning, 2014.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|