Affiliation:
1. Slavonic Institute of the Albert-Ludwig - University of Freiburg , Freiburg , Germany
Abstract
Abstract
Quantitative, corpus based research on spontaneous spoken Carpathian Rusyn language can cause several data-related problems: Speakers are using ambivalent forms in different quantities, resulting in a biased data set – while a stricter data-cleaning process would lead to a large scale data loss. On top of that, polytomous categorical dependent variables are hard to analyze due to methodological limitations. This paper provides several approaches to face unbalanced and biased data sets containing variation of conjugational forms of the verb maty ‘to have’ and (po-)znaty ‘to know’ in Carpathian Rusyn language. Using resampling based methods like Cross-Validation, Bootstrapping and Random Forests, we provide a strategy for circumventing possible methodological pitfalls and gaining the most information from our precious data, without trying to p-hack the results. Calculating the predictive power of several sociolinguistic factors on linguistic variation, we can make valid statements about the (sociolinguistic) status of Rusyn and the stability of the old dialect continuum of Rusyn varieties.
Subject
Linguistics and Language,Language and Linguistics,Linguistics and Language,Language and Linguistics
Reference30 articles.
1. [1] Auer, P., and Hinskens, F. (1996). Convergence and Divergence of Dialects in Europe. In Sociolinguistica (10).10.1515/9783110245158.1
2. [2] Woolhiser, C. (2005). Political borders and dialect divergence/convergence in Europe. P. Auer, F. Hinskens and P. Kerswill (eds.). Dialect change: Convergence and divergence in european languages. Cambridge, pages 236–262.
3. [3] RStudio Team. (2020). RStudio: Integrated Development for R. RStudio, PBC, Boston, MA. Accessible at: http://www.rstudio.com/.
4. [4] Magocsi, P. R. (2015). With Their Backs to the Mountains: A History of Carpathian Rus’ and Carpatho-Rusyns. Budapest.10.1515/9789633861073
5. [5] H. A. Skrypnyk (ed.). (2013). Ukrajinci-Rusyny: Etnolinhvistyčni ta etnokul’turni procesy v istoryčnomu rozvytku. Kyjiv.