Rethinking Reinforcement Learning for Recommendation-Reference-Cited by-同舟云学术

Rethinking Reinforcement Learning for Recommendation

Published:2022-07-06 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval
language:
Short-container-title:

Author:

Xin Xin¹,Pimentel Tiago²,Karatzoglou Alexandros³,Ren Pengjie¹,Christakopoulou Konstantina⁴,Ren Zhaochun¹

Affiliation:

1. Shandong University, Qingdao City, China

2. University of Cambridge, Cambridge, United Kingdom

3. Google Research, London, United Kingdom

4. Google, Mountain View, CA, USA

Funder

National Key R\&D Program of China

Tencent WeChat Rhino-Bird Focused Research Program

Natural Science Foundation of Shandong Province

Fundamental Research Funds of Shandong University

Key Scientific and Technological Innovation Program of Shandong Province

Natural Science Foundation of China

Shandong University multidisciplinary research and innovation team of young scholars

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3477495.3531714

Reference47 articles.

1. M Mehdi Afsar Trafford Crump and Behrouz Far. 2021. Reinforcement learning based recommender systems: A survey. arXiv preprint arXiv:2101.06286. M Mehdi Afsar Trafford Crump and Behrouz Far. 2021. Reinforcement learning based recommender systems: A survey. arXiv preprint arXiv:2101.06286.

2. Sanjeev Arora , Rong Ge , Yingyu Liang , Tengyu Ma , and Yi Zhang . 2017 . Generalization and equilibrium in generative adversarial nets (gans) . In International Conference on Machine Learning. PMLR, 224--232 . Sanjeev Arora, Rong Ge, Yingyu Liang, Tengyu Ma, and Yi Zhang. 2017. Generalization and equilibrium in generative adversarial nets (gans). In International Conference on Machine Learning. PMLR, 224--232.

3. Jimmy Lei Ba , Jamie Ryan Kiros, and Geoffrey E Hinton . 2016 . Layer normalization. arXiv preprint arXiv:1607.06450. Jimmy Lei Ba, Jamie Ryan Kiros, and Geoffrey E Hinton. 2016. Layer normalization. arXiv preprint arXiv:1607.06450.

4. Richard Bellman . 1966 . Dynamic programming . Science , Vol. 153 , 3731, 34--37. Richard Bellman. 1966. Dynamic programming. Science, Vol. 153, 3731, 34--37.

5. Jiawei Chen Hande Dong Xiang Wang Fuli Feng Meng Wang and Xiangnan He. 2020. Bias and debias in recommender system: A survey and future directions. arXiv preprint arXiv:2010.03240. Jiawei Chen Hande Dong Xiang Wang Fuli Feng Meng Wang and Xiangnan He. 2020. Bias and debias in recommender system: A survey and future directions. arXiv preprint arXiv:2010.03240.

Cited by 18 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Achieving EEG-based depression recognition using Decentralized-Centralized structure;Biomedical Signal Processing and Control;2024-09

2. Future Impact Decomposition in Request-level Recommendations;Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2024-08-24

3. Reinforcement Learning-based Recommender Systems with Large Language Models for State Reward and Action Modeling;Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval;2024-07-10

4. Rethinking Offline Reinforcement Learning for Sequential Recommendation from A Pair-Wise Q-Learning Perspective;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30

5. Personalization for web-based services using offline reinforcement learning;Machine Learning;2024-03-28