Meta-CRS: A Dynamic Meta-Learning Approach for Effective Conversational Recommender System

Author:

Ni Yuxin1ORCID,Xia Yunwen2ORCID,Fang Hui3ORCID,Long Chong4ORCID,Kong Xinyu4ORCID,Li Daqian4ORCID,Yang Dong4ORCID,Zhang Jie2ORCID

Affiliation:

1. Nanyang Technological University, Singapore

2. School of Computer Science and Engineering, Nanyang Technological University, Singapore

3. RIIS, China

4. Ant Group, China

Abstract

Conversational recommender system (CRS) enhances the recommender system by acquiring the latest user preference through dialogues, where an agent needs to decide “whether to ask or recommend”, “which attributes to ask”, and “which items to recommend” in each round. To explore these questions, reinforcement learning is adopted in most CRS frameworks. However, existing studies somewhat ignore to consider the connection between the previous rounds and the current round of the conversation, which might lead to the lack of prior knowledge and inaccurate decisions. In this view, we propose to facilitate the connections between different rounds of conversations in a dialogue session through deep transformer-based multi-channel meta-reinforcement learning, so that the CRS agent can decide each action/decision based on previous states, actions, and their rewards. Besides, to better utilize a user’s historical preferences, we propose a more dynamic and personalized graph structure to support the conversation module and the recommendation module. Experiment results on five real-world datasets and an online evaluation with real users in an industrial environment validate the improvement of our method over the state-of-the-art approaches and the effectiveness of our designs.

Funder

Shanghai Rising-Star Program

Natural Science Foundation of Shanghai

National Natural Science Foundation of China

Program for Innovative Research Team of Shanghai University of Finance and Economics, Ant Group, and China Mobile Research Institute

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Science Applications,General Business, Management and Accounting,Information Systems

Reference62 articles.

1. Antreas Antoniou, Harri Edwards, and Amos Storkey. 2019. How to train your MAML. In Proceedings of the 7th International Conference on Learning Representations.

2. Jacob Beck Risto Vuorio Evan Zheran Liu Zheng Xiong Luisa Zintgraf Chelsea Finn and Shimon Whiteson. 2023. A survey of meta-reinforcement learning. arXiv:2301.08028. Retrieved from https://arxiv.org/abs/2301.08028.

3. Antoine Bordes Nicolas Usunier Alberto Garcia-Durán Jason Weston and Oksana Yakhnenko. 2013. Translating embeddings for modeling multi-relational data. In Proceedings of the 26th International Conference on Neural Information Processing Systems - Volume 2 (NIPS’13) . Curran Associates Inc. Red Hook NY 2787–2795.

4. Qibin Chen Junyang Lin Yichang Zhang Ming Ding Yukuo Cen Hongxia Yang and Jie Tang. 2019. Towards knowledge-based recommender dialog system. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP’19) 1803–1813.

5. Q&R

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3