Study on Representation Invariances of CNNs and Human Visual Information Processing Based on Data Augmentation-Reference-Cited by-同舟云学术

Study on Representation Invariances of CNNs and Human Visual Information Processing Based on Data Augmentation

Published:2020-09-02 Issue:9 Volume:10 Page:602
ISSN:2076-3425
Container-title:Brain Sciences
language:en
Short-container-title:Brain Sciences

Author:

Cui Yibo^ORCID,Zhang Chi,Qiao Kai,Wang Linyuan^ORCID,Yan Bin,Tong Li

Abstract

Representation invariance plays a significant role in the performance of deep convolutional neural networks (CNNs) and human visual information processing in various complicated image-based tasks. However, there has been abounding confusion concerning the representation invariance mechanisms of the two sophisticated systems. To investigate their relationship under common conditions, we proposed a representation invariance analysis approach based on data augmentation technology. Firstly, the original image library was expanded by data augmentation. The representation invariances of CNNs and the ventral visual stream were then studied by comparing the similarities of the corresponding layer features of CNNs and the prediction performance of visual encoding models based on functional magnetic resonance imaging (fMRI) before and after data augmentation. Our experimental results suggest that the architecture of CNNs, combinations of convolutional and fully-connected layers, developed representation invariance of CNNs. Remarkably, we found representation invariance belongs to all successive stages of the ventral visual stream. Hence, the internal correlation between CNNs and the human visual system in representation invariance was revealed. Our study promotes the advancement of invariant representation of computer vision and deeper comprehension of the representation invariance mechanism of human visual information processing.

Publisher

MDPI AG

Subject

General Neuroscience

Link

https://www.mdpi.com/2076-3425/10/9/602/pdf

Reference58 articles.

1. Pixels to Voxels: Modeling Visual Representation in the Human Brain;Agrawal;arXiv,2014

2. Deep Neural Networks Reveal a Gradient in the Complexity of Neural Representations across the Ventral Stream

3. Using goal-driven deep learning models to understand sensory cortex

4. Seeing it all: Convolutional network layers map the function of the human visual system

5. Convolutional neural network-based encoding and decoding of visual object recognition in space and time

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Latest Advances in Human Brain Dynamics;Brain Sciences;2021-11-08