Language agents reduce the risk of existential catastrophe-Reference-Cited by-同舟云学术

Language agents reduce the risk of existential catastrophe

Published:2023-08-19 Issue: Volume: Page:
ISSN:0951-5666
Container-title:AI & SOCIETY
language:en
Short-container-title:AI & Soc

Author:

Goldstein Simon,Kirk-Giannini Cameron Domenico^ORCID

Funder

The Center for AI Safety

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Human-Computer Interaction,Philosophy

Link

https://link.springer.com/content/pdf/10.1007/s00146-023-01748-4.pdf

Reference30 articles.

1. Amodei D, Clark J (2016) Faulty reward functions in the wild. Blog Post. https://blog.openai.com/faulty-reward-functions/

2. Amodei D, Olah C, Steinhardt J, Christiano P, Schulman J, Mané D (2016) Concrete problems in AI safety. Manuscript. https://arxiv.org/abs/1606.06565

3. Bostrom N (2014) Superintelligence: paths, dangers, strategies. Oxford University Press

4. Bubeck S, Chandrasekaran V, Eldan R, Gehrke J, Horvitz E, Kamar E, Lee P, Lee YT, Li Y, Lundberg S, Nori H, Palangi H, Ribeiro MT, Zhang Y (2023) Sparks of artificial general intelligence: early experiments with GPT-4. Manuscript. https://arxiv.org/abs/2303.12712

5. Burns C, Ye H, Klein D, Steinhardt J (2022) Discovering latent knowledge in language models without supervision. Manuscript. https://arxiv.org/abs/2212.03827

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Is Alignment Unsafe?;Philosophy & Technology;2024-08-27

2. Language Agents and Malevolent Design;Philosophy & Technology;2024-08-17

3. AGI crimes? The role of criminal law in mitigating existential risks posed by artificial general intelligence;AI & SOCIETY;2024-08-06

4. Risk and artificial general intelligence;AI & SOCIETY;2024-07-09

5. Assessing the risk of takeover catastrophe from large language models;Risk Analysis;2024-06-30