Affiliation:
1. Brigham Young University , Provo , USA
Abstract
Abstract
Sociophonetic data analysis involves a pipeline of processing steps to convert a raw spreadsheet of acoustic measurements to interpretable results. While most studies report the steps used in their pipeline, very few explicitly report their order in which those steps were applied. This study analyzes a dataset containing vowel formant data from 53 speakers by processing it 5,040 unique ways, each representing a different permutation of seven processing steps. To analyze the effect that an order has on the overall results, pairs of pipelines that differed only by swapping two adjacent steps were compared. The most important steps in the pipeline were when normalization happened, how outliers were detected, and when good data was excluded. This study illustrates what happens when these steps are rearranged relative to each other in order to justify and recommend the following order of operations: classifying allophones, removing outliers, normalizing, and then subsetting.
Subject
Linguistics and Language,Language and Linguistics
Reference18 articles.
1. Barreda, Santiago & Terrance M. Nearey. 2018. A regression approach to vowel normalization for missing and unbalanced data. Journal of the Acoustical Society of America 144(1). 500–520. https://doi.org/10.1121/1.5047742.
2. Becker, Kara (ed.). 2019a. The low-back-merger shift: Uniting the Canadian vowel shift, the California vowel shift, and short front vowel shifts across North America (Publication of the American Dialect society 104). Durham, NC: Duke University Press.
3. Becker, Kara. 2019b. Introduction. In Kara Becker (ed.), The low-back-merger shift: Uniting the Canadian vowel shift, the California vowel shift, and short front vowel shifts across North America (Publication of the American Dialect society 104). Durham, NC: Duke University Press.
4. Brand, James, Jen Hay, Lynn Clark, Kevin Watson & Márton Sóskuthy. 2021. Systematic co-variation of monophthongs across speakers of New Zealand English. Journal of Phonetics 88. 101096. https://doi.org/10.1016/j.wocn.2021.101096.
5. Kendall, Tyler & Charlie Farrington. 2021. The Corpus of regional African American language. Eugene, Oregon: The Online Resources for African American Language Project. Available at: http://oraal.uoregon.edu/coraal.