1. Joshua Achiam et al. "Constrained policy optimization". In: International con-ference on machine learning . PMLR. 2017 , pp. 22 -- 31 . Joshua Achiam et al. "Constrained policy optimization". In: International con-ference on machine learning. PMLR. 2017, pp. 22--31.
2. Pulkit Agrawal . " The Task Specification Problem". In: Conference on Robot Learning. PMLR. 2022 , pp. 1745 -- 1751 . Pulkit Agrawal. "The Task Specification Problem". In: Conference on Robot Learning. PMLR. 2022, pp. 1745--1751.
3. APRIL: Active Preference Learning-Based Reinforcement Learning
4. Preference-Based Policy Learning
5. Mohammed Alshiekh et al. "Safe reinforcement learning via shielding ". In: Proceedings of the AAAI Conference on Artificial Intelligence. Vol. 32 . 1. 2018 . Mohammed Alshiekh et al. "Safe reinforcement learning via shielding". In: Proceedings of the AAAI Conference on Artificial Intelligence. Vol. 32. 1. 2018.