Author:
Liu Kweiguu,Maghsudi Setareh,Yokoo Makoto
Publisher
Springer Nature Switzerland
Reference19 articles.
1. Amuru, S., Buehrer, R.M.: Optimal jamming using delayed learning. In: 2014 IEEE Military Communications Conference, IEEE (2014), pp. 1528–1533 (2014)
2. Badanidiyuru, A., Langford, J., Slivkins, A.: Resourceful contextual bandits. In: Conference on Learning Theory, PMLR (2014), pp. 1109–1134 (2014)
3. Bastani, H., et al.: Efficient and targeted Covid-19 border testing via reinforcement learning. Nature 599(7883), 108–113 (2021)
4. Bubeck, S., Cesa-Bianchi, N., Lugosi, G.: Bandits with heavy tail. IEEE Trans. Inf. Theory 59(11), 7711–7717 (2013)
5. Bubeck, S., Wang, T., Viswanathan, N.: Multiple identifications in multi-armed bandits. In: International Conference on Machine Learning, PMLR (2013), pp. 258–265 (2013)