1. Achiam, J., Held, D., Tamar, A., and Abbeel, P. (2017). Constrained policy optimization. In Proceedings of the 34th International Conference on Machine Learning-Volume 70, 22–31. JMLR. org.
2. Andersson, O., Heintz, F., and Doherty, P. (2015). Model-based reinforcement learning in continuous environments using real-time constrained optimization. In Twenty-Ninth AAAI Conference on Artificial Intelligence.
3. Reinforcement learning–overview of recent progress and implications for process control;Badgwell,2018
4. Bertsekas, D.P. (2007). Dynamic Programming and Optimal Control, Vol. II. Athena Scientific, 3rd edition.
5. Q-learning for risk-sensitive control;Borkar;Mathematics of operations research,2002