Abstract
Purpose
The purpose of this paper is to explain to readers how intelligent systems can fail and how artificial intelligence (AI) safety is different from cybersecurity. The goal of cybersecurity is to reduce the number of successful attacks on the system; the goal of AI Safety is to make sure zero attacks succeed in bypassing the safety mechanisms. Unfortunately, such a level of performance is unachievable. Every security system will eventually fail; there is no such thing as a 100 per cent secure system.
Design/methodology/approach
AI Safety can be improved based on ideas developed by cybersecurity experts. For narrow AI Safety, failures are at the same, moderate level of criticality as in cybersecurity; however, for general AI, failures have a fundamentally different impact. A single failure of a superintelligent system may cause a catastrophic event without a chance for recovery.
Findings
In this paper, the authors present and analyze reported failures of artificially intelligent systems and extrapolate our analysis to future AIs. The authors suggest that both the frequency and the seriousness of future AI failures will steadily increase.
Originality/value
This is a first attempt to assemble a public data set of AI failures and is extremely valuable to AI Safety researchers.
Subject
Business and International Management,Management of Technology and Innovation
Reference67 articles.
1. AAAI 2006 spring symposium reports;AI Magazine,2006
2. Concrete problems in AI safety,2016
3. Security solutions for intelligent and complex systems,2016
4. Thinking inside the box: controlling and using an oracle ai;Minds and Machines,2012
5. Babcock, J., Kramar, J. and Yampolskiy, R. (2016a), “The AGI containment problem”, Paper presented at the The Ninth Conference on Artificial General Intelligence (AGI2015).
Cited by
33 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献