A bad arm existence checking problem: How to utilize asymmetric problem structure?-Reference-Cited by-同舟云学术

A bad arm existence checking problem: How to utilize asymmetric problem structure?

Published:2019-10-30 Issue:2 Volume:109 Page:327-372
ISSN:0885-6125
Container-title:Machine Learning
language:en
Short-container-title:Mach Learn

Author:

Tabata Koji,Nakamura Atsuyoshi^ORCID,Honda Junya,Komatsuzaki Tamiki

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Software

Link

http://link.springer.com/content/pdf/10.1007/s10994-019-05854-7.pdf

Reference18 articles.

1. Audibert, J., Bubeck, S., & Munos, R. (2010). Best arm identification in multi-armed bandits. In Proceedings of the 23rd conference on learning theory (pp. 41–53).

2. Auer, P., Cesa-Bianchi, N., & Fischer, P. (2002). Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47(2–3), 235–256.

3. Auer, P., Cesa-Bianchi, N., Freund, Y., & Schapire, R. E. (2003). The nonstochastic multiarmed bandit problem. SIAM Journal on Computing, 32(1), 48–77.

4. Bubeck, S., Munos, R., & Stoltz, G. (2011). Pure exploration in finitely-armed and continuous-armed bandits. Theoretical Computer Science, 412(19), 1832–1852.

5. Bubeck, S., Wang, T., & Viswanathan, N. (2013). Multiple identifications in multi-armed bandits. In Proceedings of the 30th international conference on machine learning, proceedings of machine learning research, vol 28 (pp. 258–265).

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Multi-armed bandit algorithm for sequential experiments of molecular properties with dynamic feature selection;The Journal of Chemical Physics;2024-07-03

2. Gaussian process classification bandits;Pattern Recognition;2024-05

3. On-the-fly Raman microscopy guaranteeing the accuracy of discrimination;Proceedings of the National Academy of Sciences;2024-03-14

4. On-the-fly Raman microscopy guaranteeing the accuracy of diagnosis by reinforcement learning;High-Speed Biomedical Imaging and Spectroscopy VIII;2023-03-16

5. Classification Bandits: Classification Using Expected Rewards as Imperfect Discriminators;Lecture Notes in Computer Science;2021