1. Achiam, J., Held, D., Tamar, A., and Abbeel, P. (2017). Constrained policy optimization. In D. Precup and Y.W. Teh (eds.), Proceedings of the 34th International Conference on Machine Learning, volume 70 of Proceedings of Machine Learning Research, 22–31. PMLR, International Convention Centre, Sydney, Australia. URL http://proceedings.mlr.press/v70/achiam17a.html.
2. Dynamic programming;Bellman;Science,1966
3. Chow, Y., Nachum, O., Faust, A., Duenez-Guzman, E., and Ghavamzadeh, M. (2019). Lyapunov-based safe policy optimization for continuous control. arXiv preprint arXiv:1901.10031.
4. An efficient constraint handling method for genetic algorithms;Deb;Computer methods in applied mechanics and engineering,2000
5. Nonlinear model predictive control;Grüne,2017