Abstract
For many tasks such as retrieving a previously viewed object, an observer must form a representation of the world at one location and use it at another. A world-based three-dimensional reconstruction of the scene built up from visual information would fulfil this requirement, something computer vision now achieves with great speed and accuracy. However, I argue that it is neither easy nor necessary for the brain to do this. I discuss biologically plausible alternatives, including the possibility of avoiding three-dimensional coordinate frames such as ego-centric and world-based representations. For example, the distance, slant and local shape of surfaces dictate the propensity of visual features to move in the image with respect to one another as the observer's perspective changes (through movement or binocular viewing). Such propensities can be stored without the need for three-dimensional reference frames. The problem of representing a stable scene in the face of continual head and eye movements is an appropriate starting place for understanding the goal of three-dimensional vision, more so, I argue, than the case of a static binocular observer.
This article is part of the themed issue ‘Vision in our three-dimensional world’.
Funder
Engineering and Physical Sciences Research Council
Subject
General Agricultural and Biological Sciences,General Biochemistry, Genetics and Molecular Biology
Cited by
10 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Virtual Reality for Vision Science;Virtual Reality in Behavioral Neuroscience: New Insights and Methods;2023
2. Understanding 3D vision as a policy network;Philosophical Transactions of the Royal Society B: Biological Sciences;2022-12-13
3. Adaptive cognitive maps for curved surfaces in the 3D world;Cognition;2022-08
4. V1 as an egocentric cognitive map;Neuroscience of Consciousness;2021-09-01
5. Adaptive cognitive maps for curved surfaces in the 3D world;2021-09-01