Publisher
Springer Nature Switzerland
Reference10 articles.
1. Altman, E.: Constrained Markov Decision Processes: Stochastic Modeling. Routledge (1999)
2. Clempner, J.B.: A lyapunov approach for stable reinforcement learning. Comput. Appl. Math. 41, 279 (2022)
3. Clempner, J.B., Poznyak, A.S.: Analysis of best-reply strategies in repeated finite markov chains games. In: 52nd IEEE Conference on Decision and Control (CDC), pp. 568–573. Firenze, Italy (2013)
4. Clempner, J.B., Poznyak, A.S.: Simple computing of the customer lifetime value: a fixed local-optimal policy approach. J. Syst. Sci. Syst. Eng. 23(4), 439–459 (2014)
5. Howard, R.A.: Dynamic Programming and Markov Processes (1960)