On Admissible Behaviours for Goal-Oriented Decision-Making of Value-Aware Agents-Reference-Cited by-同舟云学术

On Admissible Behaviours for Goal-Oriented Decision-Making of Value-Aware Agents

Published:2023 Issue: Volume: Page:415-424
ISSN:0302-9743
Container-title:Multi-Agent Systems
language:
Short-container-title:

Author:

Holgado-Sánchez Andrés^ORCID,Arias Joaquín^ORCID,Moreno-Rebato Mar^ORCID,Ossowski Sascha^ORCID

Publisher

Springer Nature Switzerland

Link

https://link.springer.com/content/pdf/10.1007/978-3-031-43264-4_27

Reference17 articles.

1. Arnold, T., Kasenberg, D., Scheutz, M.: Value alignment or misalignment - what will keep systems accountable? In: AAAI Workshop on AI, Ethics, and Society (2017)

2. Christiano, P., Leike, J., Brown, T.B., Martic, M., Legg, S., Amodei, D.: Deep reinforcement learning from human preferences (2023)

3. Fürnkranz, J., Hüllermeier, E., Cheng, W., Park, S.H.: Preference-based reinforcement learning: a formal framework and a policy iteration algorithm. Mach. Learn. 89, 123–156 (2012). https://doi.org/10.1007/s10994-012-5313-8

4. Government, S.: Strategic project for economic recovery and transformation of digitalization of the water cycle. Report 2022. Technical report, Ministry for the Ecological Transition and Demographic Challenge (2022)

5. Guo, T., Yuan, Y., Zhao, P.: Admission-based reinforcement-learning algorithm in sequential social dilemmas. Appl. Sci. 13(3) (2023). https://doi.org/10.3390/app13031807. www.mdpi.com/2076-3417/13/3/1807

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Algorithms for Learning Value-Aligned Policies Considering Admissibility Relaxation;Lecture Notes in Computer Science;2024