Abstract
We present a biologically inspired recurrent neural network (RNN) that efficiently detects changes in natural images. The model features sparse, topographic connectivity (st-RNN), closely modeled on the circuit architecture of a “midbrain attention network.” We deployed the st-RNN in a challenging change blindness task, in which changes must be detected in a discontinuous sequence of images. Compared with a conventional RNN, the st-RNN learned 9x faster and achieved state-of-the-art performance with 15x fewer connections. An analysis of low-dimensional dynamics revealed putative circuit mechanisms, including a critical role for a global inhibitory (GI) motif, for successful change detection. The model reproduced key experimental phenomena, including midbrain neurons' sensitivity to dynamic stimuli, neural signatures of stimulus competition, as well as hallmark behavioral effects of midbrain microstimulation. Finally, the model accurately predicted human gaze fixations in a change blindness experiment, surpassing state-of-the-art saliency-based methods. The st-RNN provides a novel deep learning model for linking neural computations underlying change detection with psychophysical mechanisms.SIGNIFICANCE STATEMENTFor adaptive survival, our brains must be able to accurately and rapidly detect changing aspects of our visual world. We present a novel deep learning model, a sparse, topographic recurrent neural network (st-RNN), that mimics the neuroanatomy of an evolutionarily conserved “midbrain attention network.” The st-RNN achieved robust change detection in challenging change blindness tasks, outperforming conventional RNN architectures. The model also reproduced hallmark experimental phenomena, both neural and behavioral, reported in seminal midbrain studies. Lastly, the st-RNN outperformed state-of-the-art models at predicting human gaze fixations in a laboratory change blindness experiment. Our deep learning model may provide important clues about key mechanisms by which the brain efficiently detects changes.
Funder
Department of Biotechnology, Ministry of Science and Technology, India
DST | Science and Engineering Research Board
CSIR Ph.D. Fellowship
Reference100 articles.
1. Abadi M , Barham P , Chen J , Chen Z , Davis A (2016) Tensorflow: a system for large-scale machine learning. In: Proceedings of the 12th USENIX Conference on Operating Systems Design and Implementation. ACM. Available at https://dl.acm.org/doi/10.5555/3026877.3026899 .
2. Achanta R , Hemami S , Estrada F , Susstrunk S (2009) Frequency-tuned salient region detection. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, IEEE, pp 1597–1604. IEEE. Available at https://doi.org/10.1109/CVPR.2009.5206596 .
3. A Model of the Superior Colliculus Predicts Fixation Locations during Scene Viewing and Visual Search
4. Unraveling causal mechanisms of top-down and bottom-up visuospatial attention with non-invasive brain stimulation;Banerjee;J Indian Inst Sci,2019
5. Functional, molecular and morphological heterogeneity of superficial interneurons in the larval zebrafish tectum;Barker;J Comp Neurol,2021
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献