Deep convolutional models improve predictions of macaque V1 responses to natural images-Reference-Cited by-同舟云学术

Deep convolutional models improve predictions of macaque V1 responses to natural images

Published:2017-10-11 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Cadena Santiago A.,Denfield George H.,Walker Edgar Y.^ORCID,Gatys Leon A.,Tolias Andreas S.,Bethge Matthias,Ecker Alexander S.

Abstract

AbstractDespite great efforts over several decades, our best models of primary visual cortex (V1) still predict spiking activity quite poorly when probed with natural stimuli, highlighting our limited understanding of the nonlinear computations in V1. Recently, two approaches based on deep learning have been successfully applied to neural data: On the one hand, transfer learning from networks trained on object recognition worked remarkably well for predicting neural responses in higher areas of the primate ventral stream, but has not yet been used to model spiking activity in early stages such as V1. On the other hand, data-driven models have been used to predict neural responses in the early visual system (retina and V1) of mice, but not primates. Here, we test the ability of both approaches to predict spiking activity in response to natural images in V1 of awake monkeys. Even though V1 is rather at an early to intermediate stage of the visual system, we found that the transfer learning approach performed similarly well to the data-driven approach and both outperformed classical linear-nonlinear and wavelet-based feature representations that build on existing theories of V1. Notably, transfer learning using a pre-trained feature space required substantially less experimental time to achieve the same performance. In conclusion, multi-layer convolutional neural networks (CNNs) set the new state of the art for predicting neural responses to natural images in primate V1 and deep features learned for object recognition are better explanations for V1 computation than all previous filter bank theories. This finding strengthens the necessity of V1 models that are multiple nonlinearities away from the image domain and it supports the idea of explaining early visual cortex based on high-level functional goals.Author summaryPredicting the responses of sensory neurons to arbitrary natural stimuli is of major importance for understanding their function. Arguably the most studied cortical area is primary visual cortex (V1), where many models have been developed to explain its function. However, the most successful models built on neurophysiologists’ intuitions still fail to account for spiking responses to natural images. Here, we model spiking activity in primary visual cortex (V1) of monkeys using deep convolutional neural networks (CNNs), which have been successful in computer vision. We both trained CNNs directly to fit the data, and used CNNs trained to solve a high-level task (object categorization). With these approaches, we are able to outperform previous models and improve the state of the art in predicting the responses of early visual neurons to natural images. Our results have two important implications. First, since V1 is the result of several nonlinear stages, it should be modeled as such. Second, functional models of entire visual pathways, of which V1 is an early stage, do not only account for higher areas of such pathways, but also provide useful representations for V1 predictions.

Publisher

Cold Spring Harbor Laboratory

Reference71 articles.

1. Do We Know What the Early Visual System Does?

2. Receptive fields of single neurones in the cat's striate cortex

3. Receptive fields and functional architecture of monkey striate cortex

4. An evaluation of the two-dimensional Gabor filter model of simple receptive fields in cat striate cortex

5. Half-squaring in responses of cat striate cells

Cited by 31 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Do Topographic Deep ANN Models of the Primate Ventral Stream Predict the Perceptual Effects of Direct IT Cortical Interventions?;2024-01-09

2. Biophysical neural adaptation mechanisms enable deep learning models to capture dynamic retinal computation;2023-06-24

3. Generalization in data-driven models of primary visual cortex;2020-10-07

4. Topographic deep artificial neural networks reproduce the hallmarks of the primate inferior temporal cortex face processing network;2020-07-10

5. Toward the Next Generation of Retinal Neuroprosthesis: Visual Computation with Spikes;Engineering;2020-04