1. Joshua Achiam , David Held , Aviv Tamar , and Pieter Abbeel . 2017 . Constrained policy optimization . In International conference on machine learning. PMLR, 22–31 . Joshua Achiam, David Held, Aviv Tamar, and Pieter Abbeel. 2017. Constrained policy optimization. In International conference on machine learning. PMLR, 22–31.
2. Relaxations of Weakly Coupled Stochastic Dynamic Programs
3. Rishabh Agarwal , Dale Schuurmans , and Mohammad Norouzi . 2020 . An optimistic perspective on offline reinforcement learning . In International Conference on Machine Learning. PMLR, 104–114 . Rishabh Agarwal, Dale Schuurmans, and Mohammad Norouzi. 2020. An optimistic perspective on offline reinforcement learning. In International Conference on Machine Learning. PMLR, 104–114.
4. Eitan Altman . 1999. Constrained Markov decision processes . Routledge . Eitan Altman. 1999. Constrained Markov decision processes. Routledge.
5. Kiam Heong Ang , Gregory Chong , and Yun Li. 2005. PID control system analysis, design, and technology . IEEE transactions on control systems technology 13, 4 ( 2005 ), 559–576. Kiam Heong Ang, Gregory Chong, and Yun Li. 2005. PID control system analysis, design, and technology. IEEE transactions on control systems technology 13, 4 (2005), 559–576.