1. Constrained policy optimization;Achiam,2017
2. Faster policy learning with continuous-time gradients;Ainsworth,2021
3. Amos, B., Jimenez, I., Sacks, J., Boots, B., and Kolter, J.Z. (2018). Differentiable MPC for End-to-end Planning and Control. In S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi, and R. Garnett (eds.), Advances in Neural Information Processing Systems, volume 31, 8289–8300. Curran Associates, Inc.
4. Automatic differentiation in machine learning: a survey;Baydin;The Journal of Machine Learning Research,2017
5. Nonlinear Programming: Concepts, Algorithms, and Applications to Chemical Processes;Biegler;Society for Industrial and Applied Mathematics,2010