Abstract
AbstractThe field of “BERTology” aims to locate linguistic representations in large language models (LLMs). These have commonly been interpreted as representing structural descriptions (SDs) familiar from theoretical linguistics, such as abstract phrase-structures. However, it is unclear how such claims should be interpreted in the first place. This paper identifies six possible readings of “linguistic representation” from philosophical and linguistic literature, concluding that none has a straight-forward application to BERTology. In philosophy, representations are typically analyzed as cognitive vehicles individuated by intentional content. This clashes with a prevalent mentalist interpretation of linguistics, which treats SDs as (narrow) properties of cognitive vehicles themselves. I further distinguish between three readings of both kinds, and discuss challenges each brings for BERTology. In particular, some readings would make it trivially false to assign representations of SDs to LLMs, while others would make it trivially true. I illustrate this with the concrete case study of structural probing: a dominant model-interpretation technique. To improve the present situation, I propose that BERTology should adopt a more “LLM-first” approach instead of relying on pre-existing linguistic theories developed for orthogonal purposes.
Funder
Kulttuurin ja Yhteiskunnan Tutkimuksen Toimikunta
Publisher
Springer Science and Business Media LLC
Subject
General Social Sciences,Philosophy
Reference112 articles.
1. Adger, D. (2022). What are linguistic representations? Mind & Language, 37(2), 248–260.
2. Behme, C. (2015). Is the ontology of biolinguistics coherent? Language Sciences, 47, 32–42.
3. Belinkov, Y., & Glass, J. (2019). Analysis methods in neural language processing: A survey. Transactions of the Association for Computational Linguistics, 7, 49–72.
4. Benacerraf, P. (1973). Mathematical truth. Journal of Philosophy, 70(19), 661–679.
5. Blaho, S. (2007). The syntax of phonology: A radically substance-free approach (PhD Thesis). University of Tromsø.