An algorithm for learning phonological classes from distributional similarity-Reference-Cited by-同舟云学术

An algorithm for learning phonological classes from distributional similarity

Published:2020-02 Issue:1 Volume:37 Page:91-131
ISSN:0952-6757
Container-title:Phonology
language:en
Short-container-title:Phonology

Author:

Mayer Connor^ORCID

Abstract

An important question in phonology is to what degree the learner uses distributional information rather than substantive properties of speech sounds when learning phonological structure. This paper presents an algorithm that learns phonological classes from only distributional information: the contexts in which sounds occur. The input is a segmental corpus, and the output is a set of phonological classes. The algorithm is first tested on an artificial language, with both overlapping and nested classes reflected in the distribution, and retrieves the expected classes, performing well as distributional noise is added. It is then tested on four natural languages. It distinguishes between consonants and vowels in all cases, and finds more detailed, language-specific structure. These results improve on past approaches, and are encouraging, given the paucity of the input. More refined models may provide additional insight into which phonological classes are apparent from the distributions of sounds in natural languages.

Publisher

Cambridge University Press (CUP)

Subject

Linguistics and Language,Language and Linguistics

Reference80 articles.

1. Learning phonotactic distributions

2. Estimating the Dimension of a Model

3. Contextual word similarity and estimation from sparse data

4. DATA CLUSTERING

5. Beguš, Gašper (2018a). Unnatural phonology: a synchrony-diachrony interface approach. PhD dissertation, Harvard University.

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Comprehension and production of Kinyarwanda verbs in the Discriminative Lexicon;Linguistics;2023-11-10

2. Using hidden Markov models to find discrete targets in continuous sociophonetic data;Linguistics Vanguard;2021-01-01