Visual Turing test for computer vision systems-Reference-Cited by-同舟云学术

Visual Turing test for computer vision systems

Published:2015-03-09 Issue:12 Volume:112 Page:3618-3623
ISSN:0027-8424
Container-title:Proceedings of the National Academy of Sciences
language:en
Short-container-title:Proc Natl Acad Sci USA

Author:

Geman Donald,Geman Stuart,Hallonquist Neil,Younes Laurent

Abstract

Today, computer vision systems are tested by their accuracy in detecting and localizing instances of objects. As an alternative, and motivated by the ability of humans to provide far richer descriptions and even tell a story about an image, we construct a “visual Turing test”: an operator-assisted device that produces a stochastic sequence of binary questions from a given test image. The query engine proposes a question; the operator either provides the correct answer or rejects the question as ambiguous; the engine proposes the next question (“just-in-time truthing”). The test is then administered to the computer-vision system, one question at a time. After the system’s answer is recorded, the system is provided the correct answer and the next question. Parsing is trivial and deterministic; the system being tested requires no natural language processing. The query engine employs statistical constraints, learned from a training set, to produce questions with essentially unpredictable answers—the answer to a question, given the history of questions and their correct answers, is nearly equally likely to be positive or negative. In this sense, the test is only about vision. The system is designed to produce streams of questions that follow natural story lines, from the instantiation of a unique object, through an exploration of its properties, and on to its relationships with other uniquely instantiated objects.

Funder

DOD | Office of Naval Research

DOD | Defense Advanced Research Projects Agency

National Science Foundation

Publisher

Proceedings of the National Academy of Sciences

Subject

Multidisciplinary

Reference28 articles.

1. Computing machinery and intelligence;Turing;Mind,1950

2. Saygin AP Cicekli I Akman V (2003) Turing Test: 50 Years Later. The Turing Test, ed Moor JH (Springer, Heidelberg, Germany), pp 23–78

3. Russell SJ Norvig P (2003) Artificial Intelligence: A Modern Approach (Pearson Education, Harlow, UK)

4. The Pascal Visual Object Classes (VOC) Challenge

5. Deng J (2009) Imagenet: A large-scale hierarchical image database. Proceedings IEEE 2009 CVPR (IEEE, New York), pp 248–255

Cited by 161 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Generative adversarial network–based reconstruction of healthy anatomy for anomaly detection in brain CT scans;Journal of Medical Imaging;2024-08-09

2. EL PROBLEMA DE LA ÉTICA: EL CONVIVIO. Ética e Inteligencia Artificial;Revista Iberoamericana de Complejidad y Ciencias Económicas;2024-06-30

3. Multimodal attention-driven visual question answering for Malayalam;Neural Computing and Applications;2024-05-10

4. Detect2Interact: Localizing Object Key Field in Visual Question Answering with LLMs;IEEE Intelligent Systems;2024-05

5. Generative artificial intelligence to produce high-fidelity blastocyst-stage embryo images;Human Reproduction;2024-04-10