Simulating a Primary Visual Cortex at the Front of CNNs Improves Robustness to Image Perturbations-Reference-Cited by-同舟云学术

Simulating a Primary Visual Cortex at the Front of CNNs Improves Robustness to Image Perturbations

Published:2020-06-17 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Dapello Joel^ORCID,Marques Tiago^ORCID,Schrimpf Martin^ORCID,Geiger Franziska,Cox David D.^ORCID,DiCarlo James J.^ORCID

Abstract

AbstractCurrent state-of-the-art object recognition models are largely based on convolutional neural network (CNN) architectures, which are loosely inspired by the primate visual system. However, these CNNs can be fooled by imperceptibly small, explicitly crafted perturbations, and struggle to recognize objects in corrupted images that are easily recognized by humans. Here, by making comparisons with primate neural data, we first observed that CNN models with a neural hidden layer that better matches primate primary visual cortex (V1) are also more robust to adversarial attacks. Inspired by this observation, we developed VOneNets, a new class of hybrid CNN vision models. Each VOneNet contains a fixed weight neural network front-end that simulates primate V1, called the VOneBlock, followed by a neural network back-end adapted from current CNN vision models. The VOneBlock is based on a classical neuroscientific model of V1: the linear-nonlinear-Poisson model, consisting of a biologically-constrained Gabor filter bank, simple and complex cell nonlinearities, and a V1 neuronal stochasticity generator. After training, VOneNets retain high ImageNet performance, but each is substantially more robust, outperforming the base CNNs and state-of-the-art methods by 18% and 3%, respectively, on a conglomerate benchmark of perturbations comprised of white box adversarial attacks and common image corruptions. Finally, we show that all components of the VOneBlock work in synergy to improve robustness. While current CNN architectures are arguably brain-inspired, the results presented here demonstrate that more precisely mimicking just one stage of the primate visual system leads to new gains in ImageNet-level computer vision applications.

Publisher

Cold Spring Harbor Laboratory

Reference101 articles.

1. “Going Deeper with Convolutions”,2014

2. “Very Deep Convolutional Networks for Large-Scale Image Recognition”,2014

3. “Deep Residual Learning for Image Recognition”,2015

Cited by 46 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Towards human-leveled vision systems;Science China Technological Sciences;2024-07-30

2. Convolutional networks can model the functional modulation of MEG responses during reading;2024-05-30

3. Convolutional networks can model the functional modulation of MEG responses during reading;2024-05-30

4. Synergy Exploration in Deploying Convolutional Neural Networks Across Distributed Neuromorphic System;2024 IEEE International Instrumentation and Measurement Technology Conference (I2MTC);2024-05-20

5. Informing Machine Perception With Psychophysics;Proceedings of the IEEE;2024-02