Teach and Explore: A Multiplex Information-guided Effective and Efficient Reinforcement Learning for Sequential Recommendation

Author:

Yan Surong1,Shi Chenglong1,Wang Haosen2,Chen Lei3,Jiang Ling1,Guo Ruilin1,Lin Kwei-Jay4

Affiliation:

1. Zhejiang University of Finance and Economics, China

2. Southeast University, China

3. Shanghai Jiao Tong University, China

4. University of California Irvine, USA and Chang Gung University, China

Abstract

Casting sequential recommendation (SR) as a reinforcement learning (RL) problem is promising and some RL-based methods have been proposed for SR. However, these models are sub-optimal due to the following limitations: a) they fail to leverage the supervision signals in the RL training to capture users’ explicit preferences, leading to slow convergence; and b) they do not utilize auxiliary information (e.g., knowledge graph) to avoid blindness when exploring users’ potential interests. To address the above-mentioned limitations, we propose a multiplex information-guided RL model (MELOD), which employs a novel RL training framework with Teach and Explore components for SR. We adopt a Teach component to accurately capture users’ explicit preferences and speed up RL convergence. Meanwhile, we design a dynamic intent induction network (DIIN) as a policy function to generate diverse predictions. We utilize the DIIN for the Explore component to mine users’ potential interests by conducting a sequential and knowledge information joint-guided exploration. Moreover, a sequential and knowledge-aware reward function is designed to achieve stable RL training. These components significantly improve MELOD’s performance and convergence against existing RL algorithms to achieve effectiveness and efficiency. Experimental results on seven real-world datasets show that our model significantly outperforms state-of-the-art methods.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Science Applications,General Business, Management and Accounting,Information Systems

Reference55 articles.

1. Recommender system application developments: A survey

2. Deep Learning Based Recommender System

3. Wayne Xin Zhao , Shanlei Mu , Yupeng Hou , Zihan Lin , Yushuo Chen , Xingyu Pan , Kaiyuan Li , Yujie Lu , Hui Wang , Changxin Tian , Yingqian Min , Zhichao Feng , Xinyan Fan , Xu Chen , Pengfei Wang , Wendi Ji , Yaliang Li , Xiaoling Wang , and Ji-Rong Wen . 2021 . RecBole: Towards a Unified, Comprehensive and Efficient Framework for Recommendation Algorithms. In CIKM ’21 : The 30th ACM International Conference on Information and Knowledge Management. 4653–4664 . Wayne Xin Zhao, Shanlei Mu, Yupeng Hou, Zihan Lin, Yushuo Chen, Xingyu Pan, Kaiyuan Li, Yujie Lu, Hui Wang, Changxin Tian, Yingqian Min, Zhichao Feng, Xinyan Fan, Xu Chen, Pengfei Wang, Wendi Ji, Yaliang Li, Xiaoling Wang, and Ji-Rong Wen. 2021. RecBole: Towards a Unified, Comprehensive and Efficient Framework for Recommendation Algorithms. In CIKM ’21: The 30th ACM International Conference on Information and Knowledge Management. 4653–4664.

4. Reinforcement Learning to Rank in E-Commerce Search Engine

5. Fajie Yuan , Alexandros Karatzoglou , Ioannis Arapakis , Joemon  M. Jose , and Xiangnan He. A Simple Convolutional Generative Network for Next Item Recommendation. In Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining. 582–590 . Fajie Yuan, Alexandros Karatzoglou, Ioannis Arapakis, Joemon M. Jose, and Xiangnan He. A Simple Convolutional Generative Network for Next Item Recommendation. In Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining. 582–590.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3