User Cold-start Problem in Multi-armed Bandits: When the First Recommendations Guide the User’s Experience-Reference-Cited by-同舟云学术

User Cold-start Problem in Multi-armed Bandits: When the First Recommendations Guide the User’s Experience

Published:2023-01-27 Issue:1 Volume:1 Page:1-24
ISSN:2770-6699
Container-title:ACM Transactions on Recommender Systems
language:en
Short-container-title:ACM Trans. Recomm. Syst.

Author:

Silva Nicollas¹^ORCID,Silva Thiago²^ORCID,Werneck Heitor²^ORCID,Rocha Leonardo²^ORCID,Pereira Adriano¹^ORCID

Affiliation:

1. Universidade Federal de Minas Gerais, Belo Horizonte, Brazil

2. Universidade Federal de São João del-Rei, São João del-Rei, Brazil

Abstract

Nowadays, Recommender Systems have played a crucial role in several entertainment scenarios by making personalised recommendations and guiding the entire users’ journey from their first interaction. Recent works have addressed it as a Contextual Bandit by providing a sequential decision model to explore items not tried yet (or not tried enough) or exploit the best options learned so far. However, this work noticed these current algorithms are limited to naive non-personalised approaches in the first interactions of a new user, offering random or most popular items. Through experiments in three domains, we identify a negative impact of these first choices. Our study indicates that the bandit performance is directly related to the choices made in the first trials. Then, we propose a new approach to balance exploration and exploitation in the first interactions and handle these drawbacks. This approach is based on the Active Learning theory to catch more information about the new users and improve their long-term experience. Our idea is to explore the potential information gain of items that can also please the user’s taste. This method is named Warm-Starting Contextual Bandits, and it statistically outperforms 10 benchmarks in the literature in the long run.

Funder

CNPq

CAPES

Fapemig

AWS

INWEB

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3554819

Reference50 articles.

1. Rabaa Alabdulrahman, Herna Viktor, and Eric Paquet. 2019. Active learning and deep learning for the cold-start problem in recommendation system: A comparative study. In International Joint Conference on Knowledge Discovery, Knowledge Engineering, and Knowledge Management. Springer, 24–53.

2. Introduction to Bandits in Recommender Systems

3. Recommender systems survey;Bobadilla Jesús;Knowl.-Bas. Syst.,2013

4. Djallel Bouneffouf, Romain Laroche, Tanguy Urvoy, Raphael Féraud, and Robin Allesiardo. 2014. Contextual bandit for active learning: Active thompson sampling. In International Conference on Neural Information Processing. Springer, 405–412.

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. New Community Cold-Start Recommendation: A Novel Large Language Model-based Method;SSRN Electronic Journal;2024

2. Tackling cold-start with deep personalized transfer of user preferences for cross-domain recommendation;International Journal of Data Science and Analytics;2023-11-03

3. A Complete Framework for Offline and Counterfactual Evaluations of Interactive Recommendation Systems;Proceedings of the 29th Brazilian Symposium on Multimedia and the Web;2023-10-23

4. Transparently Serving the Public: Enhancing Public Service Media Values through Exploration;Proceedings of the 17th ACM Conference on Recommender Systems;2023-09-14