Deep Recurrent Neural Network Reveals a Hierarchy of Process Memory during Dynamic Natural Vision-Reference-Cited by-同舟云学术

Deep Recurrent Neural Network Reveals a Hierarchy of Process Memory during Dynamic Natural Vision

Published:2017-08-17 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Shi Junxing,Wen Haiguang,Zhang Yizhen,Han Kuan,Liu Zhongming

Abstract

ABSTRACTThe human visual cortex extracts both spatial and temporal visual features to support perception and guide behavior. Deep convolutional neural networks (CNNs) provide a computational framework to model cortical representation and organization for spatial visual processing, but unable to explain how the brain processes temporal information. To overcome this limitation, we extended a CNN by adding recurrent connections to different layers of the CNN to allow spatial representations to be remembered and accumulated over time. The extended model, or the recurrent neural network (RNN), embodied a hierarchical and distributed model of process memory as an integral part of visual processing. Unlike the CNN, the RNN learned spatiotemporal features from videos to enable action recognition. The RNN better predicted cortical responses to natural movie stimuli than the CNN, at all visual areas especially those along the dorsal stream. As a fully-observable model of visual processing, the RNN also revealed a cortical hierarchy of temporal receptive window, dynamics of process memory, and spatiotemporal representations. These results support the hypothesis of process memory, and demonstrate the potential of using the RNN for in-depth computational understanding of dynamic natural vision.

Publisher

Cold Spring Harbor Laboratory

Reference69 articles.

1. Adolf, D. , Weston, S. , Baecke, S. , Luchtmann, M. , Bernarding, J. , & Kropf, S. (2014). Increasing the reliability of data analysis of functional magnetic resonance imaging by applying a new blockwise permutation method. Frontiers in neuroinformatics, 8.

2. Ballas, N. , Yao, L. , Pal, C. , & Courville, A. (2015). Delving deeper into convolutional networks for learning video representations. arXiv preprint arXiv:1511.06432.

3. Boureau, Y.-L. , Ponce, J. , & LeCun, Y. (2010). A theoretical analysis of feature pooling in visual recognition. Paper presented at the Proceedings of the 27th international conference on machine learning (ICML-10).

4. State-dependent computations: spatiotemporal processing in cortical networks

5. Modeling the hemodynamic response to brain activation

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Approximating the Architecture of Visual Cortex in a Convolutional Network;Neural Computation;2019-08

2. Variational Autoencoder: An Unsupervised Model for Modeling and Decoding fMRI Activity in Visual Cortex;2017-11-05

3. Activations of Deep Convolutional Neural Network are Aligned with Gamma Band Activity of Human Visual Cortex;2017-05-03