Multi-modal recommendation algorithm fusing visual and textual features-Reference-Cited by-同舟云学术

Multi-modal recommendation algorithm fusing visual and textual features

Published:2023-06-29 Issue:6 Volume:18 Page:e0287927
ISSN:1932-6203
Container-title:PLOS ONE
language:en
Short-container-title:PLoS ONE

Author:

Hu Xuefeng,Yu Wenting^ORCID,Wu Yun,Chen Yukang

Abstract

In recommender systems, the lack of interaction data between users and items tends to lead to the problem of data sparsity and cold starts. Recently, the interest modeling frameworks incorporating multi-modal features are widely used in recommendation algorithms. These algorithms use image features and text features to extend the available information, which alleviate the data sparsity problem effectively, but they also have some limitations. On the one hand, multi-modal features of user interaction sequences are not considered in the interest modeling process. On the other hand, the aggregation of multi-modal features often employs simple aggregators, such as sums and concatenation, which do not distinguish the importance of different feature interactions. In this paper, to tackle this, we propose the FVTF (Fusing Visual and Textual Features) algorithm. First, we design a user history visual preference extraction module based on the Query-Key-Value attention to model users’ historical interests by using of visual features. Second, we design a feature fusion and interaction module based on the multi-head bit-wise attention to adaptively mine important feature combinations and update the higher-order attention fusion representation of features. We conduct experiments on the Movielens-1M dataset, and the experiments show that FVTF achieved the best performance compared with the benchmark recommendation algorithms.

Funder

National Natural Science Foundation of China

Science and Technology Foundation of Guizhou Province

Publisher

Public Library of Science (PLoS)

Subject

Multidisciplinary

Reference35 articles.

1. Wang X. A Survey of Online Advertising Click-Through Rate Prediction Models. In: 2020 IEEE International Conference on Information Technology, Big Dataand Artificial Intelligence (ICIBA). vol. 1. IEEE; 2020. p. 516–521.

2. A CTR prediction approach for text advertising based on the SAE-LR deep neural network;Z Jiang;Journal of Information Processing Systems,2017

3. Richardson M, Dominowska E, Ragno R. Predicting clicks: estimating the click-through rate for new ads. In: Proceedings of the 16th international conference on World Wide Web; 2007. p. 521–530.

4. Rendle S. Factorization machines. In: 2010 IEEE International conference on data mining. IEEE; 2010. p. 995–1000.

5. Wide & Deep Learning for Recommender Systems

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Exploring the Landscape of Hybrid Recommendation Systems in E-Commerce: A Systematic Literature Review;IEEE Access;2024