1. Abadi, M., Barham, P., Chen, J., Chen, Z., Davis, A., & Dean, J., et al. (2016). Tensorflow: A system for large-scale machine learning. In 12th {USENIX} symposium on operating systems design and implementation (pp. 265–283).
2. Constrained policy optimization;Achiam,2017
3. Constrained markov decision processes (vol. 7);Altman,1999
4. Concrete problems in ai safety;Amodei,2016
5. Safe model-based reinforcement learning with stability guarantees;Berkenkamp,2017