Medial temporal cortex supports compositional visual inferences-Reference-Cited by-同舟云学术

Medial temporal cortex supports compositional visual inferences

Published:2023-09-08 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Bonnen Tyler^ORCID,Wagner Anthony D.^ORCID,Yamins Daniel L.K.^ORCID

Abstract

Perception unfolds across multiple timescales. For humans and other primates, many object-centric visual attributes can be inferred ‘at a glance’ (i.e., with<200ms of visual information), an ability supported by ventral temporal cortex (VTC). Other perceptual inferences require more time; to determine a novel object’s identity, we might need to represent its unique configuration of visual features, requiring multiple ‘glances.’ Here we evaluate whether medial temporal cortex (MTC), downstream from VTC, supports object perception by integrating over such visuospatial sequences. We first compare human visual inferences directly to electrophysiological recordings from macaque VTC. While human performance ‘at a glance’ is approximated by a linear readout of VTC, participants radically outperform VTC given longer viewing times (i.e.,>200ms). Next, we demonstrate the causal role of MTC in these temporally extended visual inferences: just as time restricted performance can be approximated by a linear readout of VTC, the performance of (time unrestricted) MTC-lesioned humans resembles a computational proxy for VTC. Finally, we characterize these visual abilities through a series of eyetracking experiments. With extended viewing times participants sequentially sample task-relevant features via multiple saccades—visuospatial patterns that are reliable across participants and necessary for performance. From these data, we suggest that MTC transforms visuospatial sequences into ‘compositional’ representations that support visual object perception.

Publisher

Cold Spring Harbor Laboratory

Reference72 articles.

1. Progress and limitations of deep networks to recognize objects in unusual poses;Proceedings of the AAAI Conference on Artificial Intelligence,2023

2. Alcorn, M. A. , Li, Q. , Gong, Z. , Wang, C. , Mai, L. , Ku, W.-S. , & Nguyen, A. (2019). Strike (with) a pose: Neural networks are easily fooled by strange poses of familiar objects. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 4845–4854.

3. Seeing faces is necessary for face-domain formation

4. Deep convolutional networks do not classify based on global object shape;PLoS computational biology,2018

5. The human medial temporal lobe processes online representations of complex objects