1. Arslan, G., Yuksel, S.: Decentralized Q-learning for stochastic teams and games. IEEE Trans. Autom. Control 62(4), 1545–1558 (2017)
2. Hernandez-Lerma, O.: Adaptive Markov Control Processes. Springer, New York (1989)
3. Wallis, W.A.: The statistical research group, 1942–1945. J. Am. Stat. Assoc. 75(370), 320–330 (1980)
4. Department of the Navy, Science and Technology Strategy for Intelligent Autonomous Systems (2021)
5. Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction, 2nd edn. The MIT Press (2020)