Active Vision in Binocular Depth Estimation: A Top-Down Perspective-Reference-Cited by-同舟云学术

Active Vision in Binocular Depth Estimation: A Top-Down Perspective

Published:2023-09-21 Issue:5 Volume:8 Page:445
ISSN:2313-7673
Container-title:Biomimetics
language:en
Short-container-title:Biomimetics

Author:

Priorelli Matteo¹^ORCID,Pezzulo Giovanni²^ORCID,Stoianov Ivilin Peev¹^ORCID

Affiliation:

1. Institute of Cognitive Sciences and Technologies, National Research Council of Italy, 35137 Padova, Italy

2. Institute of Cognitive Sciences and Technologies, National Research Council of Italy, 00185 Rome, Italy

Abstract

Depth estimation is an ill-posed problem; objects of different shapes or dimensions, even if at different distances, may project to the same image on the retina. Our brain uses several cues for depth estimation, including monocular cues such as motion parallax and binocular cues such as diplopia. However, it remains unclear how the computations required for depth estimation are implemented in biologically plausible ways. State-of-the-art approaches to depth estimation based on deep neural networks implicitly describe the brain as a hierarchical feature detector. Instead, in this paper we propose an alternative approach that casts depth estimation as a problem of active inference. We show that depth can be inferred by inverting a hierarchical generative model that simultaneously predicts the eyes’ projections from a 2D belief over an object. Model inversion consists of a series of biologically plausible homogeneous transformations based on Predictive Coding principles. Under the plausible assumption of a nonuniform fovea resolution, depth estimation favors an active vision strategy that fixates the object with the eyes, rendering the depth belief more accurate. This strategy is not realized by first fixating on a target and then estimating the depth; instead, it combines the two processes through action–perception cycles, with a similar mechanism of the saccades during object recognition. The proposed approach requires only local (top-down and bottom-up) message passing, which can be implemented in biologically plausible neural circuits.

Publisher

MDPI AG

Subject

Molecular Medicine,Biomedical Engineering,Biochemistry,Biomaterials,Bioengineering,Biotechnology

Link

https://www.mdpi.com/2313-7673/8/5/445/pdf

Reference50 articles.

1. Binocular disparity and the perception of depth;Qian;Neuron,1997

2. Binocular depth perception and the cerebral cortex;Parker;Nat. Rev. Neurosci.,2007

3. Anterior Regions of Monkey Parietal Cortex Process Visual 3D Shape;Durand;Neuron,2007

4. 3D shape perception from combined depth cues in human visual cortex;Welchman;Nat. Neurosci.,2005

5. Depth cues, rather than perceived depth, govern vergence;Wismeijer;Exp. Brain Res.,2008

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Learning and embodied decisions in active inference;2024-08-21

2. Embodied decisions as active inference;2024-06-01

3. Modeling motor control in continuous-time Active Inference: a survey;IEEE Transactions on Cognitive and Developmental Systems;2024

4. Deep kinematic inference affords efficient and scalable control of bodily movements;Proceedings of the National Academy of Sciences;2023-12-12