1. Apprenticeship learning via inverse reinforcement learning;Abbeel,2004
2. Achiam, J., Held, D., Tamar, A., Abbeel, P., 2017. Constrained policy optimization. 1705.10528.
3. Constrained markov decision processes;Altman,1999
4. Dynamic programming and optimal control;Bertsekas,1995
5. Economic stochastic model predictive control using the unscented kalman filter;Bradford;IFAC-PapersOnLine,2018