Interactive Thompson Sampling for Multi-objective Multi-armed Bandits-Reference-Cited by-同舟云学术

登录注册会员服务联系我们

Interactive Thompson Sampling for Multi-objective Multi-armed Bandits

Published:2017 Issue: Volume: Page:18-34
ISSN:0302-9743
Container-title:Algorithmic Decision Theory
language:
Short-container-title:

Author:

Roijers Diederik M.,Zintgraf Luisa M.,Nowé Ann

Publisher

Springer International Publishing

Link

http://link.springer.com/content/pdf/10.1007/978-3-319-67504-6_2

Reference23 articles.

1. Agrawal, S., Goyal, N.: Analysis of Thompson sampling for the multi-armed bandit problem. In: COLT, p. 39.1–39.26 (2012)

2. Auer, P., Cesa-Bianchi, N., Fischer, P.: Finite-time analysis of the multiarmed bandit problem. Mach. Learn. 47(2–3), 235–256 (2002)

3. Auer, P., Chiang, C.-K., Ortner, R., Drugan, M.M.: Pareto front identification from stochastic bandit feedback. In: AISTATS, pp. 939–947 (2016)

4. Benabbou, N., Perny, P.: Combining preference elicitation and search in multiobjective state-space graphs. In: IJCAI, pp. 297–303 (2015)

5. Bishop, C.M.: Pattern Recognition and Machine Learning. Springer, New York (2006)

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Handling Varied Objectives by Online Decision Making;Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2024-08-24

2. Bandit algorithms: A comprehensive review and their dynamic selection from a portfolio for multicriteria top-k recommendation;Expert Systems with Applications;2024-07

3. Interactive Multi-Objective Reinforcement Learning for Continuous Robot Control;2023 20th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP);2023-12-15

4. Block-Level Surrogate Models for Inference Time Estimation in Hardware-Aware Neural Architecture Search;Machine Learning and Knowledge Discovery in Databases;2023

5. Expected scalarised returns dominance: a new solution concept for multi-objective decision making;Neural Computing and Applications;2022-07-05

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线，采集、加工和组织学术论文而形成的新型学术文献查询和分析系统，可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容，当前同舟云学术共收录了国内外主流学术期刊6万余种，收集的期刊论文及会议论文总量共计约1.5亿篇，并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询！咨询电话：010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号京ICP备18003416号-3