1. Lattimore, T., Szepesvari, C.: Bandit Algorithms. Cambridge University Press, Cambridge (2020)
2. Sragovich, V.: Mathematical Theory of Adaptive Control. World Scientific, Singapore (2006)
3. Tsetlin, M.: Automaton Theory and Modeling of Biological Systems. Academic Press, New York (1973)
4. Auer, P.: Using confidence bounds for exploitation-exploration trade-offs. J. Mach. Learn. Res. 3, 397–422 (2002)
5. Lugosi, G., Cesa-Bianchi, N.: Prediction, Learning and Game. University Press, New York (2006)