1. Auer, P.: Using Confidence Bounds for Exploitation-Exploration Trade-Offs. Journal of Machine Learning Research 3, 397–422 (2002)
2. Cover, T.M., Thomas, J.A.: Elements of Information Theory. John Wiley & Sons (2006)
3. Deisenroth, M.P., Fox, D., Rasmussen, C.E.: Gaussian Processes for Data-Efficient Learning in Robotics and Control. Transactions on Pattern Analysis and Machine Intelligence 37, 408–423 (2015)
4. Fedorov, V.V.: Theory of Optimal Experiments. Academic Press (1972)
5. Galichet, N., Sebag, M., Teytaud, O.: Exploration vs exploitation vs safety: risk-aware multi-armed bandits. In: Ong, C.S., Ho, T.B. (eds.) Proceedings of the 5th Asian Conference on Machine Learning, JMLR: W&CP, vol. 29, pp. 245–260 (2013)