Using token-based semantic vector spaces for corpus-linguistic analyses: From practical applications to tests of theoretical claims-Reference-Cited by-同舟云学术

Using token-based semantic vector spaces for corpus-linguistic analyses: From practical applications to tests of theoretical claims

Published:2017-09-26 Issue:0 Volume:0 Page:
ISSN:1613-7027
Container-title:Corpus Linguistics and Linguistic Theory
language:
Short-container-title:

Author:

Hilpert Martin¹,Correia Saavedra David¹

Affiliation:

1. Department of English, Université de Neuchâtel, Neuchâtel, Switzerland

Abstract

AbstractThis paper presents token-based semantic vector spaces as a tool that can be applied in corpus-linguistic analyses such as word sense comparisons, comparisons of synonymous lexical items, and matching of concordance lines with a given text. We demonstrate how token-based semantic vector spaces are created, and we illustrate the kinds of result that can be obtained with this approach. Our main argument is that token-based semantic vector spaces are not only useful for practical corpus-linguistic applications but also for the investigation of theory-driven questions. We illustrate this point with a discussion of the asymmetric priming hypothesis (Jäger and Rosenbach 2008). The asymmetric priming hypothesis, which states that grammaticalizing constructions will be primed by their lexical sources but not vice versa, makes a number of empirically testable predictions. We operationalize and test these predictions, concluding that token-based semantic vector spaces yield conclusions that are relevant for linguistic theory-building.

Publisher

Walter de Gruyter GmbH

Subject

Linguistics and Language,Language and Linguistics

Link

https://www.degruyter.com/downloadpdf/journals/cllt/ahead-of-print/article-10.1515-cllt-2017-0009/article-10.1515-cllt-2017-0009.xml

Reference52 articles.

1. Mapping meaning with distributional methods. A diachronic corpus-based study of existential there;Journal of Historical Linguistics,2013

Cited by 38 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Could this be next for corpus linguistics? Methods of semi-automatic data annotation with contextualized word embeddings;Linguistics Vanguard;2024-06-25

2. Corpus linguistics meets historical linguistics and construction grammar: how far have we come, and where do we go from here?;Corpus Linguistics and Linguistic Theory;2024-03-25

3. In search of methodological standards for corpus-based cognitive semantics: The case of Behavioral Profiles;Studia Neophilologica;2024-01-31

4. Meaning differences between English clippings and their source words: A corpus-based study;ICAME Journal;2023-05-01

5. 9 Modals in the network model of Construction Grammar;Models of Modals;2023-04-12