Abstract
AbstractAs a task of high importance for recommender systems, we consider the problem of learning the convex combination of ranking algorithms by online machine learning. First, we propose a stochastic optimization algorithm that uses finite differences. Our new algorithm achieves close to optimal empirical performance for two base rankers, while scaling well with an increased number of models. In our experiments with five real-world recommendation data sets, we show that the combination offers significant improvement over previously known stochastic optimization techniques. The proposed algorithm is the first effective stochastic optimization method for combining ranked recommendation lists by online machine learning. Secondly, we propose an exponentially weighted algorithm based on a grid over the space of combination weights. We show that the algorithm has near-optimal worst-case performance bound. The bound provides the first theoretical guarantee for non-convex bandits using limited number of evaluations under very general conditions.
Funder
Hungarian Ministry of Innovation and Technology NRDI Office within the framework of the Hungarian Artificial Intelligence National Laboratory Program
ELKH Institute for Computer Science and Control
Publisher
Springer Science and Business Media LLC
Subject
Computer Science Applications,Human-Computer Interaction,Education
Reference52 articles.
1. Abernethy, J., Canini, K., Langford, J., Simma, A.: Online Collaborative Filtering. University of California at Berkeley, Technical Report (2007)
2. Agarwal, A., Dekel, O., Xiao, L.: Optimal algorithms for online convex optimization with multi-point bandit feedback. In: COLT, pp. 28–40 (2010)
3. Al-Ghossein, M., Murena, P.A., Abdessalem, T., Barré, A., Cornuéjols, A.: Adaptive collaborative topic modeling for online recommendation. In: Proceedings of the 12th ACM Conference on Recommender Systems, pp. 338–346. ACM (2018)
4. Amatriain, X., Agarwal, D.: Tutorial: lessons learned from building real-life recommender systems. In: Proceedings of the 10th ACM Conference on Recommender Systems, p. 433 (2016)
5. Au, C.K., Leung, H.F.: An empirical comparison of CMA-ES in dynamic environments. In: International Conference on Parallel Problem Solving from Nature, pp. 529–538. Springer (2012)
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献