1. Joshua Achiam , David Held , Aviv Tamar , and Pieter Abbeel . 2017 . Constrained policy optimization . In International conference on machine learning. PMLR, 22--31 . Joshua Achiam, David Held, Aviv Tamar, and Pieter Abbeel. 2017. Constrained policy optimization. In International conference on machine learning. PMLR, 22--31.
2. Eitan Altman . 1999. Constrained Markov decision processes: stochastic modeling . Routledge . Eitan Altman. 1999. Constrained Markov decision processes: stochastic modeling. Routledge.
3. Dongsheng Ding , Xiaohan Wei , Zhuoran Yang , Zhaoran Wang , and Mihailo Jovanovic . 2021 . Provably efficient safe exploration via primal-dual policy optimization . In International Conference on Artificial Intelligence and Statistics. PMLR, 3304--3312 . Dongsheng Ding, Xiaohan Wei, Zhuoran Yang, Zhaoran Wang, and Mihailo Jovanovic. 2021. Provably efficient safe exploration via primal-dual policy optimization. In International Conference on Artificial Intelligence and Statistics. PMLR, 3304--3312.
4. Natural policy gradient primal-dual method for constrained markov decision processes;Ding Dongsheng;Advances in Neural Information Processing Systems,2020