Affiliation:
1. National Key Laboratory of Information Systems Engineering, National University of Defense Technology, Changsha 410003, China
Abstract
Research on recommendation methods using multimodal graph information presents a significant challenge within the realm of information services. Prior studies in this area have lacked precision in the purification and denoising of multimodal information and have insufficiently explored fusion methods. We introduce a multimodal graph recommendation approach leveraging cross-attention fusion. This model enhances and purifies multimodal information by embedding the IDs of items and their corresponding interactive users, thereby optimizing the utilization of such information. To facilitate better integration, we propose a cross-attention mechanism-based multimodal information fusion method, which effectively processes and merges related and differential information across modalities. Experimental results on three public datasets indicated that our model performed exceptionally well, demonstrating its efficacy in leveraging multimodal information.
Funder
National Defense Basic Scientific Research Program
Reference42 articles.
1. Cinar, Y.G., and Renders, J. (2020, January 25). Adaptive Pointwise-Pairwise Learning-to-Rank for Content-based Personalized Recommendation. Proceedings of the RecSys, Rio de Janeiro, Brazil.
2. Learning the User’s Deeper Preferences for Multi-modal Recommendation Systems;Lei;ACM Trans. Multim. Comput. Commun. Appl.,2023
3. Improving Image Representations via MoCo Pre-training for Multimodal CXR Classification;Serra;Lecture Notes in Computer Science, Proceedings of the Medical Image Understanding and Analysis, Cambridge, UK, 27–29 July 2022,2022
4. Multi-Modal Variational Graph Auto-Encoder for Recommendation Systems;Yi;IEEE Trans. Multim.,2022
5. Chen, X., Chen, H., Xu, H., Zhang, Y., Cao, Y., Qin, Z., and Zha, H. (2019, January 21–25). Personalized Fashion Recommendation with Visual Explanations based on Multimodal Attention Network: Towards Visually Explainable Recommendation. Proceedings of the SIGIR, Paris, France.