Abstract
This Element introduces a usage-based computational approach to Construction Grammar that draws on techniques from natural language processing and unsupervised machine learning. This work explores how to represent constructions, how to learn constructions from a corpus, and how to arrange the constructions in a grammar as a network. From a theoretical perspective, this Element examines how construction grammars emerge from usage alone as complex systems, with slot-constraints learned at the same time that constructions are learned. From a practical perspective, this work is accompanied by a Python package which enables linguists to incorporate construction grammars into their own corpus-based work. The computational experiments in this Element are important for testing the learnability, variability, and confirmability of Construction Grammar as a theory of language. All code examples will leverage the cloud computing platform Code Ocean to guide readers through implementation of these algorithms.
Publisher
Cambridge University Press
Reference81 articles.
1. Piao, S. , Bianchi, F. , Dayrell, C. , D’egidio, A. , & Rayson, P. (2015). Development of the multilingual semantic annotation system. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (pp. 1268–1274). Association for Computational Linguistics.
2. Mikolov, T. , Sutskever, I. , Chen, K. , Corrado, G. , & Dean, J. (2013). Distributed representations of words and phrases and their compositionality. In Proceedings of the 26th International Conference on Neural Information Processing Systems – Volume 2 (pp. 3111–3119). Curran Associates Inc.
3. The magical number seven, plus or minus two: Some limits on our capacity for processing information.
4. Nevens, J. , Doumen, J. , Van Eecke, P. , & Beuls, K. (2022). Language acquisition through intention reading and pattern finding. In Proceedings of the 29th International Conference on Computational Linguistics (pp. 15–25). International Committee on Computational Linguistics.
5. Finding variants for construction-based dialectometry: A corpus-based approach to regional CxGs