Multimodal representation learning for tourism recommendation with two-tower architecture

Author:

Cui YuhangORCID,Liang ShengbinORCID,Zhang YuYing

Abstract

Personalized recommendation plays an important role in many online service fields. In the field of tourism recommendation, tourist attractions contain rich context and content information. These implicit features include not only text, but also images and videos. In order to make better use of these features, researchers usually introduce richer feature information or more efficient feature representation methods, but the unrestricted introduction of a large amount of feature information will undoubtedly reduce the performance of the recommendation system. We propose a novel heterogeneous multimodal representation learning method for tourism recommendation. The proposed model is based on two-tower architecture, in which the item tower handles multimodal latent features: Bidirectional Long Short-Term Memory (Bi-LSTM) is used to extract the text features of items, and an External Attention Transformer (EANet) is used to extract image features of items, and connect these feature vectors with item IDs to enrich the feature representation of items. In order to increase the expressiveness of the model, we introduce a deep fully connected stack layer to fuse multimodal feature vectors and capture the hidden relationship between them. The model is tested on the three different datasets, our model is better than the baseline models in NDCG and precision.

Funder

FDCT Funding Scheme for Postdoctoral Researchers of Higher Education Institutions

Publisher

Public Library of Science (PLoS)

Reference50 articles.

1. Collaborative filtering recommendation algorithm based on user correlation and evolutionary clustering;J. Chen;Complex & Intelligent Systems,2020

2. An Improved Dual-Channel Deep Q-Network Model for Tourism Recommendation;S. Liang;Big Data,2023

3. Sampling-bias-corrected neural modeling for large corpus item recommendations;X. Yi;Proceedings of the 13th ACM Conference on Recommender Systems,2019

4. Mixed negative sampling for learning two-tower neural networks in recommendations;J. Yang;Companion Proceedings of the Web Conference,2020

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3