In Conversation with Artificial Intelligence: Aligning language Models with Human Values-Reference-Cited by-同舟云学术

In Conversation with Artificial Intelligence: Aligning language Models with Human Values

Published:2023-04-19 Issue:2 Volume:36 Page:
ISSN:2210-5433
Container-title:Philosophy & Technology
language:en
Short-container-title:Philos. Technol.

Author:

Kasirzadeh Atoosa,Gabriel Iason

Abstract

AbstractLarge-scale language technologies are increasingly used in various forms of communication with humans across different contexts. One particular use case for these technologies is conversational agents, which output natural language text in response to prompts and queries. This mode of engagement raises a number of social and ethical questions. For example, what does it mean to align conversational agents with human norms or values? Which norms or values should they be aligned with? And how can this be accomplished? In this paper, we propose a number of steps that help answer these questions. We start by developing a philosophical analysis of the building blocks of linguistic communication between conversational agents and human interlocutors. We then use this analysis to identify and formulate ideal norms of conversation that can govern successful linguistic communication between humans and conversational agents. Furthermore, we explore how these norms can be used to align conversational agents with human values across a range of different discursive domains. We conclude by discussing the practical implications of our proposal for the design of conversational agents that are aligned with these norms and values.

Publisher

Springer Science and Business Media LLC

Subject

History and Philosophy of Science,Philosophy

Link

https://link.springer.com/content/pdf/10.1007/s13347-023-00606-x.pdf

Reference116 articles.

1. Abid, A., Farooqi, M., & Zou, J. (2021). Persistent anti-muslim bias in large language models. In Proceedings of the 2021 AAAI/ACM conference on AI ethics, and society (pp. 298–306).

2. Ackerly, B.A. (2000). Political theory and feminist social criticism. New York: Cambridge University Press.

3. Allan, K. (2013). What is common ground?. In Perspectives on linguistic pragmatics (pp. 285–310). Springer.

4. Anderson, E. (2004). Uses of value judgments in science: a general argument, with lessons from a case study of feminist research on divorce. Hypatia, 19 (1), 1–24.

5. Androutsopoulos, J. (2014). Languaging when contexts collapse: Audience design in social networking. Discourse, Context & Media, 4, 62–73.

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Defending ChatGPT against jailbreak attack via self-reminders;Nature Machine Intelligence;2023-12-12

2. Augmenting Intelligence With Generative AI;Practices That Promote Innovation for Talented Students;2023-11-17

3. Leveraging Generative AI and Large Language Models: A Comprehensive Roadmap for Healthcare Integration;Healthcare;2023-10-20

4. Typology of Risks of Generative Text-to-Image Models;Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society;2023-08-08

5. “Personhood and AI: Why large language models don’t understand us”;AI & SOCIETY;2023-07-12