1. Arnold, T., Kasenberg, D., Scheutz, M.: Value alignment or misalignment - what will keep systems accountable? In: AAAI Workshop on AI, Ethics, and Society (2017)
2. Christiano, P., Leike, J., Brown, T.B., Martic, M., Legg, S., Amodei, D.: Deep reinforcement learning from human preferences (2023)
3. Fürnkranz, J., Hüllermeier, E., Cheng, W., Park, S.H.: Preference-based reinforcement learning: a formal framework and a policy iteration algorithm. Mach. Learn. 89, 123–156 (2012). https://doi.org/10.1007/s10994-012-5313-8
4. Government, S.: Strategic project for economic recovery and transformation of digitalization of the water cycle. Report 2022. Technical report, Ministry for the Ecological Transition and Demographic Challenge (2022)
5. Guo, T., Yuan, Y., Zhao, P.: Admission-based reinforcement-learning algorithm in sequential social dilemmas. Appl. Sci. 13(3) (2023). https://doi.org/10.3390/app13031807. www.mdpi.com/2076-3417/13/3/1807