1. Lecture Notes in Computer Science (Lecture Notes in Artificial Intelligence);C Abrate,2021
2. Achiam, J., Held, D., Tamar, A., Abbeel, P.: Constrained policy optimization. In: International Conference on Machine Learning, pp. 22–31. PMLR (2017)
3. Altman, E.: Constrained Markov decision processes: stochastic modeling. Routledge (1999)
4. Ammar, H.B., Tutunov, R., Eaton, E.: Safe policy search for lifelong reinforcement learning with sublinear regret. In: International Conference on Machine Learning, pp. 2361–2369. PMLR (2015)
5. Bhatnagar, S., Lakshmanan, K.: An online actor-critic algorithm with function approximation for constrained Markov decision processes. J. Optim. Theory Appl. 153(3), 688–708 (2012)