My Computer Is an Honor Student — but How Intelligent Is It? Standardized Tests as a Measure of AI-Reference-Cited by-同舟云学术

My Computer Is an Honor Student — but How Intelligent Is It? Standardized Tests as a Measure of AI

Published:2016-04-13 Issue:1 Volume:37 Page:5-12
ISSN:2371-9621
Container-title:AI Magazine
language:
Short-container-title:AIMag

Author:

Clark Peter,Etzioni Oren

Abstract

Given the well-known limitations of the Turing Test, there is a need for objective tests to both focus attention on, and measure progress towards, the goals of AI. In this paper we argue that machine performance on standardized tests should be a key component of any new measure of AI, because attaining a high level of performance requires solving significant AI problems involving language understanding and world modeling - critical skills for any machine that lays claim to intelligence. In addition, standardized tests have all the basic requirements of a practical test: they are accessible, easily comprehensible, clearly measurable, and offer a graduated progression from simple tasks to those requiring deep understanding of the world. Here we propose this task as a challenge problem for the community, summarize our state-of-the-art results on math and science tests, and provide supporting datasets

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

Artificial Intelligence

Cited by 28 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Synthetic Data Generator for Solving Korean Arithmetic Word Problem;Mathematics;2022-09-27

2. ScienceQA: a novel resource for question answering on scholarly articles;International Journal on Digital Libraries;2022-07-20

3. A survey on providing customer and public administration based services using AI: chatbot;Multimedia Tools and Applications;2022-01-03

4. Is My Model Using the Right Evidence? Systematic Probes for Examining Evidence-Based Tabular Reasoning;Transactions of the Association for Computational Linguistics;2022

5. Asking the right questions to solve algebraic word problems;Turkish Journal of Electrical Engineering and Computer Sciences;2022-01-01