Value Alignment for Advanced Artificial Judicial Intelligence-Reference-Cited by-同舟云学术

Value Alignment for Advanced Artificial Judicial Intelligence

Published:2023-04-01 Issue:2 Volume:60 Page:187-203
ISSN:0003-0481
Container-title:American Philosophical Quarterly
language:en
Short-container-title:

Author:

Winter Christoph¹,Hollman Nicholas²,Manheim David³

Affiliation:

1. Instituto Tecnológico Autónomo de México Mexico City, Mexico / Harvard University Cambridge, MA, USA Christoph_winter@fas.harvard.edu

2. Legal Priorities Project Cambridge, MA, USA

3. Israel Institute of Technology Haifa, Israel / Foresight Institute San Francisco, CA, USA

Abstract

AbstractThis paper considers challenges resulting from the use of advanced artificial judicial intelligence (AAJI). We argue that these challenges should be considered through the lens of value alignment. Instead of discussing why specific goals and values, such as fairness and nondiscrimination, ought to be implemented, we consider the question of how AAJI can be aligned with goals and values more generally, in order to be reliably integrated into legal and judicial systems. This value alignment framing draws on AI safety and alignment literature to introduce two otherwise neglected considerations for AAJI safety: specification and assurance. We outline diverse research directions and suggest the adoption of assurance and specification mechanisms as the use of AI in the judiciary progresses. While we focus on specification and assurance to illustrate the value of the AI safety and alignment literature, we encourage researchers in law and philosophy to consider what other lessons may be drawn.

Publisher

University of Illinois Press

Subject

Philosophy

Link

https://scholarlypublishingcollective.org/uip/apq/article-pdf/60/2/187/1817844/187winter.pdf

Reference137 articles.

1. The Path of the Law: Towards Legal Singularity,;Alarie;University of Toronto Law Journal,2016

2. Constitutional Law in the Age of Balancing,;Aleinikoff;Yale Law Journal,1987

3. Amodei, Dario , and JackClark. 2016. “Faulty Reward Functions in the Wild,” OpenAI, December21. http://openai.com/blog/faulty-reward-functions/.

4. Amodei, Dario , ChrisOlah, JacobSteinhardt, PaulChristiano, JohnSchulman, and DanMané. 2016. “Concrete Problems in AI Safety.” (unpublished manuscript, July 25). http://arxiv.org/abs/1606.06565.

5. “Artificial Intelligence Incident Database.” 2022. Partnership on AI. http://incidentdatabase.ai/.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Re-evaluating GPT-4’s bar exam performance;Artificial Intelligence and Law;2024-03-30

2. The Incorporation of Large Language Models (LLMs) in the Field of Education;Advances in Human and Social Aspects of Technology;2023-10-16

3. Re-Evaluating GPT-4's Bar Exam Performance;SSRN Electronic Journal;2023