Author:
Lior Gili,Shalev Yuval,Stanovsky Gabriel,Goldstein Ariel
Abstract
AbstractThe human brain is an adaptive learning system that can generalize to new tasks and unfamiliar environments. The traditional view is that such adaptive behavior requires a structural change of the learning system (e.g., via neural plasticity). In this work, we use artificial neural networks, specifically large language models (LLMs), to challenge the traditional view about the role of plasticity in learning and suggest that such an adaptive behavior can be achieved solely through computation if the learning system is suffciently trained. We focus on statistical learning paradigms. These require identifying underlying regularities in seemingly arbitrary word sequences and are largely considered to require neural plasticity. LLMs can capture arbitrary structures without weight adaptation despite the divergence from their natural language training data. Our work provides novel insights into the role of plasticity in learning, showing that suffciently trained learning systems are highly flexible, adapting to new tasks and environments solely through computation, much more than previously acknowledged. Furthermore, our work opens the door for future research to use deep learning models to conjure hypotheses about the brain.
Publisher
Cold Spring Harbor Laboratory
Reference39 articles.
1. H. C. Ellis , The transfer of learning. 200 (1965).
2. Language models are few-shot learners;Adv. Neural Inf. Process. Syst,2020
3. How does the brain learn environmental structure? Ten core principles for understanding the neurocognitive mechanisms of statistical learning;Neurosci. Biobehav. Rev,2020
4. Statistical Learning by 8-Month-Old Infants