A mathematical theory of relational generalization in transitive inference-Reference-Cited by-同舟云学术

A mathematical theory of relational generalization in transitive inference

Published:2024-07-05 Issue:28 Volume:121 Page:
ISSN:0027-8424
Container-title:Proceedings of the National Academy of Sciences
language:en
Short-container-title:Proc. Natl. Acad. Sci. U.S.A.

Author:

Lippl Samuel¹²³^ORCID,Kay Kenneth¹²⁴,Jensen Greg¹³⁵,Ferrera Vincent P.¹³⁶,Abbott L. F.¹²³^ORCID

Affiliation:

1. Mortimer B. Zuckerman Mind Brain Behavior Institute, Department of Neuroscience, Columbia University, New York, NY 10027

2. Center for Theoretical Neuroscience, Department of Neuroscience, Columbia University, New York, NY 10027

3. Department of Neuroscience, Columbia University Medical Center, New York, NY 10032

4. Grossman Center for the Statistics of Mind, Columbia University, New York, NY 10027

5. Department of Psychology, Reed College, Portland, OR 97202

6. Department of Psychiatry, Columbia University Medical Center, New York, NY 10032

Abstract

Humans and animals routinely infer relations between different items or events and generalize these relations to novel combinations of items. This allows them to respond appropriately to radically novel circumstances and is fundamental to advanced cognition. However, how learning systems (including the brain) can implement the necessary inductive biases has been unclear. We investigated transitive inference (TI), a classic relational task paradigm in which subjects must learn a relation ( A > B and B > C ) and generalize it to new combinations of items ( A > C ). Through mathematical analysis, we found that a broad range of biologically relevant learning models (e.g. gradient flow or ridge regression) perform TI successfully and recapitulate signature behavioral patterns long observed in living subjects. First, we found that models with item-wise additive representations automatically encode transitive relations. Second, for more general representations, a single scalar “conjunctivity factor” determines model behavior on TI and, further, the principle of norm minimization (a standard statistical inductive bias) enables models with fixed, partly conjunctive representations to generalize transitively. Finally, neural networks in the “rich regime,” which enables representation learning and improves generalization on many tasks, unexpectedly show poor generalization and anomalous behavior on TI. We find that such networks implement a form of norm minimization (over hidden weights) that yields a local encoding mechanism lacking transitivity. Our findings show how minimal statistical learning principles give rise to a classical relational inductive bias (transitivity), explain empirically observed behaviors, and establish a formal approach to understanding the neural basis of relational abstraction.

Funder

NSF Neuronex

Gatsby Charitable Foundation

NIH

NIMH K99

Simons Collaboration for the Global Brain

Publisher

Proceedings of the National Academy of Sciences

Link

https://pnas.org/doi/pdf/10.1073/pnas.2314511121

Reference119 articles.

1. Relational knowledge: the foundation of higher cognition

2. The responses of female baboons (Papio cynocephalus ursinus) to anomalous social interactions: Evidence for causal reasoning?

3. Path integration in mammals

4. Animal Tool-Use

5. Relational schemas and the processing of social information.