User Cold-start Problem in Multi-armed Bandits: When the First Recommendations Guide the User’s Experience

Author:

Silva Nicollas1ORCID,Silva Thiago2ORCID,Werneck Heitor2ORCID,Rocha Leonardo2ORCID,Pereira Adriano1ORCID

Affiliation:

1. Universidade Federal de Minas Gerais, Belo Horizonte, Brazil

2. Universidade Federal de São João del-Rei, São João del-Rei, Brazil

Abstract

Nowadays, Recommender Systems have played a crucial role in several entertainment scenarios by making personalised recommendations and guiding the entire users’ journey from their first interaction. Recent works have addressed it as a Contextual Bandit by providing a sequential decision model to explore items not tried yet (or not tried enough) or exploit the best options learned so far. However, this work noticed these current algorithms are limited to naive non-personalised approaches in the first interactions of a new user, offering random or most popular items. Through experiments in three domains, we identify a negative impact of these first choices. Our study indicates that the bandit performance is directly related to the choices made in the first trials. Then, we propose a new approach to balance exploration and exploitation in the first interactions and handle these drawbacks. This approach is based on the Active Learning theory to catch more information about the new users and improve their long-term experience. Our idea is to explore the potential information gain of items that can also please the user’s taste. This method is named Warm-Starting Contextual Bandits, and it statistically outperforms 10 benchmarks in the literature in the long run.

Funder

CNPq

CAPES

Fapemig

AWS

INWEB

Publisher

Association for Computing Machinery (ACM)

Reference50 articles.

1. Rabaa Alabdulrahman, Herna Viktor, and Eric Paquet. 2019. Active learning and deep learning for the cold-start problem in recommendation system: A comparative study. In International Joint Conference on Knowledge Discovery, Knowledge Engineering, and Knowledge Management. Springer, 24–53.

2. Introduction to Bandits in Recommender Systems

3. Recommender systems survey;Bobadilla Jesús;Knowl.-Bas. Syst.,2013

4. Djallel Bouneffouf, Romain Laroche, Tanguy Urvoy, Raphael Féraud, and Robin Allesiardo. 2014. Contextual bandit for active learning: Active thompson sampling. In International Conference on Neural Information Processing. Springer, 405–412.

Cited by 4 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. New Community Cold-Start Recommendation: A Novel Large Language Model-based Method;SSRN Electronic Journal;2024

2. Tackling cold-start with deep personalized transfer of user preferences for cross-domain recommendation;International Journal of Data Science and Analytics;2023-11-03

3. A Complete Framework for Offline and Counterfactual Evaluations of Interactive Recommendation Systems;Proceedings of the 29th Brazilian Symposium on Multimedia and the Web;2023-10-23

4. Transparently Serving the Public: Enhancing Public Service Media Values through Exploration;Proceedings of the 17th ACM Conference on Recommender Systems;2023-09-14

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3