Affiliation:
1. Department of English, French and German , Facultade de Filoloxía e Tradución, Universidade de Vigo , 36310 Vigo , Spain
2. Institut für Anglistik/Amerikanistik, Universität Rostock , Rostock , Germany
Abstract
Abstract
Following the quantitative turn in linguistics, the field appears to be in a methodological “wild west” state where much is possible and new frontiers are being explored, but there is relatively little guidance in terms of firm rules or conventions. In this article, we focus on the issue of variable selection in regression modeling. It is common to aim for a “minimal adequate model” and eliminate “non-significant” variables by statistical procedures. We advocate an alternative, “deductive modeling” approach that retains a “full” model of variables generated from our research questions and objectives. Comparing the statistical model to a camera, i.e., a tool to produce an image of reality, we contrast the deductive and predictive (minimal) modeling approaches on a dataset from a corpus study. While a minimal adequate model is more parsimonious, its selection procedure is blind to the research aim and may conceal relevant information. Deductive models, by contrast, are grounded in theory, have higher transparency (all relevant variables are reported) and potentially a greater accuracy of the reported effects. They are useful for answering research questions more directly, as they rely explicitly on prior knowledge and hypotheses, and allow for estimation and comparison across datasets.
Subject
Linguistics and Language,Language and Linguistics
Reference72 articles.
1. Agresti, Alan. 2002. Categorical data analysis. Hoboken, NJ: Wiley.
2. Baayen, R. Harald. 2008. Analyzing linguistic data. A practical introduction to statistics using R. Cambridge: Cambridge University Press.
3. Baayen, R. Harald. 2013. Multivariate statistics. In Robert J. Podesva & Devyani Sharma (eds.), Research methods in linguistics, 337–372. Cambridge: Cambridge University Press.
4. Baayen, Harald R., Laura A. Janda, Tore Nesset, Endresen Anna & Anastasia Makarova. 2013. Making choices in Russian: Pros and cons of statistical methods for rival forms. Russian Linguistics 37(3). 253–291. https://doi.org/10.1007/s11185-013-9118-6.
5. Barr, Dale J., Roger Levy, Christoph Scheepers & Harry J. Tily. 2013. Random effects structure for confirmatory hypothesis testing: Keep it maximal. Journal of Memory and Language 68. 255–278. https://doi.org/10.1016/j.jml.2012.11.001.
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献