Biologically Inspired Deep Learning Model for Efficient Foveal-Peripheral Vision-Reference-Cited by-同舟云学术

Biologically Inspired Deep Learning Model for Efficient Foveal-Peripheral Vision

Published:2021-11-22 Issue: Volume:15 Page:
ISSN:1662-5188
Container-title:Frontiers in Computational Neuroscience
language:
Short-container-title:Front. Comput. Neurosci.

Author:

Lukanov Hristofor,König Peter,Pipa Gordon

Abstract

While abundant in biology, foveated vision is nearly absent from computational models and especially deep learning architectures. Despite considerable hardware improvements, training deep neural networks still presents a challenge and constraints complexity of models. Here we propose an end-to-end neural model for foveal-peripheral vision, inspired by retino-cortical mapping in primates and humans. Our model has an efficient sampling technique for compressing the visual signal such that a small portion of the scene is perceived in high resolution while a large field of view is maintained in low resolution. An attention mechanism for performing “eye-movements” assists the agent in collecting detailed information incrementally from the observed scene. Our model achieves comparable results to a similar neural architecture trained on full-resolution data for image classification and outperforms it at video classification tasks. At the same time, because of the smaller size of its input, it can reduce computational effort tenfold and uses several times less memory. Moreover, we present an easy to implement bottom-up and top-down attention mechanism which relies on task-relevant features and is therefore a convenient byproduct of the main architecture. Apart from its computational efficiency, the presented work provides means for exploring active vision for agent training in simulated environments and anthropomorphic robotics.

Funder

Deutsche Forschungsgemeinschaft

Publisher

Frontiers Media SA

Subject

Cellular and Molecular Neuroscience,Neuroscience (miscellaneous)

Reference84 articles.

1. “A model of bottom-up visual attention using cortical magnification,”;Aboudib,2015

2. A biologically inspired framework for visual information processing and an application on modeling bottom-up visual attention;Aboudib;Cognit. Comput,2016

3. Learning receptor positions from imperfectly known motions;Ahumada Jr;Hum. Vis. Electron. Imaging Models Methods Appl,1990

4. Object detection through search with a foveated visual system;Akbas;PLoS Comput. Biol,2017

5. “Deep networks for human visual attention: a hybrid model using foveal vision,”;Almeida,2017

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Modeling Visual Impairments with Artificial Neural Networks: a Review;2023 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW);2023-10-02

2. An overview of space-variant and active vision mechanisms for resource-constrained human inspired robotic vision;Autonomous Robots;2023-06-09