Limited correspondence in visual representation between the human brain and convolutional neural networks-Reference-Cited by-同舟云学术

Limited correspondence in visual representation between the human brain and convolutional neural networks

Published:2020-03-14 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Xu Yaoda^ORCID,Vaziri-Pashkam Maryam

Abstract

ABSTRACTConvolutional neural networks (CNNs) have achieved very high object categorization performance recently. It has increasingly become a common practice in human fMRI research to regard CNNs as working model of the human visual system. Here we reevaluate this approach by comparing fMRI responses from the human brain in three experiments with those from 14 different CNNs. Our visual stimuli included original and filtered versions of real-world object images and images of artificial objects. Replicating previous findings, we found a brain-CNN correspondence in a number of CNNs with lower and higher levels of visual representations in the human brain better resembling those of lower and higher CNN layers, respectively. Moreover, the lower layers of some CNNs could fully capture the representational structure of human early visual areas for both the original and filtered real-world object images. Despite these successes, no CNN examined could fully capture the representational structure of higher human visual processing areas. They also failed to capture that of artificial object images in all levels of visual processing. The latter is particularly troublesome, as decades of vision research has demonstrated that the same algorithms used in the processing of natural images would support the processing of artificial visual stimuli in the primate brain. Similar results were obtained when a CNN was trained with stylized object images that emphasized shape representation. CNNs likely represent visual information in fundamentally different ways from the human brain. Current CNNs thus may not serve as sound working models of the human visual system.Significance StatementRecent CNNs have achieved very high object categorization performance, with some even exceeding human performance. It has become common practice in recent neuroscience research to regard CNNs as working models of the human visual system. Here we evaluate this approach by comparing fMRI responses from the human brain with those from 14 different CNNs. Despite CNNs’ ability to successfully perform visual object categorization like the human visual system, they appear to represent visual information in fundamentally different ways from the human brain. Current CNNs thus may not serve as sound working models of the human visual system. Given the current dominating trend of incorporating CNN modeling in visual neuroscience research, our results question the validity of such an approach.

Publisher

Cold Spring Harbor Laboratory

Reference62 articles.

1. Deep convolutional networks do not classify based on global object shape;PLOS Comput Biol,2018

2. Ballester, P , de Araújo RM (2016) On the Performance of GoogLeNet and AlexNet Applied to Sketches. In AAAI (pp. 1124–1128).

3. Bashivan P , Kar K , DiCarlo JJ (2019) Neural population control via deep image synthesis. Science 364:eaav9436.

4. Controlling the False Discovery Rate - a Practical and Powerful Approach to Multiple Testing;J Roy Stat Soc B Met,1995

5. Understanding location- and feature-based processing along the human intraparietal sulcus

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Comparing memory capacity across stimuli requires maximally dissimilar foils: Using deep convolutional neural networks to understand visual working memory capacity for real-world objects;Memory & Cognition;2023-11-16

2. Two distinct networks containing position-tolerant representations of actions in the human brain;2021-06-18

3. General object-based features account for letter perception better than specialized letter features;2021-04-22

4. Using deep neural networks to evaluate object vision tasks in rats;PLOS Computational Biology;2021-03-02

5. The relative coding strength of object identity and nonidentity features in human occipito-temporal cortex and convolutional neural networks;2020-08-12