1. Amodei D, Olah C, Steinhardt J, Christiano P, Schulman J, Mané D (2016) Concrete problems in ai safety. arXiv:1606.06565
2. Shalev-Shwartz S, Shammah S, Shashua A (2016) Safe, multi-agent, reinforcement learning for autonomous driving. arXiv:1610.03295
3. Alqahtani M, Scott MJ, Hu M (2022) Dynamic energy scheduling and routing of a large fleet of electric vehicles using multi-agent reinforcement learning. Comput Ind Eng 169:108180
4. Altman E (1995) Constrained markov decision processes. PhD thesis, INRIA
5. Achiam J, Held D, Tamar A, Abbeel P (2017) Constrained policy optimization. In: International conference on machine learning, pp 22–31. PMLR