Automatic detection of prosodic boundaries in spontaneous speech-Reference-Cited by-同舟云学术

Automatic detection of prosodic boundaries in spontaneous speech

Published:2021-05-03 Issue:5 Volume:16 Page:e0250969
ISSN:1932-6203
Container-title:PLOS ONE
language:en
Short-container-title:PLoS ONE

Author:

Biron Tirza^ORCID,Baum Daniel,Freche Dominik,Matalon Nadav,Ehrmann Netanel,Weinreb Eyal,Biron David,Moses Elisha

Abstract

Automatic speech recognition (ASR) and natural language processing (NLP) are expected to benefit from an effective, simple, and reliable method to automatically parse conversational speech. The ability to parse conversational speech depends crucially on the ability to identify boundaries between prosodic phrases. This is done naturally by the human ear, yet has proved surprisingly difficult to achieve reliably and simply in an automatic manner. Efforts to date have focused on detecting phrase boundaries using a variety of linguistic and acoustic cues. We propose a method which does not require model training and utilizes two prosodic cues that are based on ASR output. Boundaries are identified using discontinuities in speech rate (pre-boundary lengthening and phrase-initial acceleration) and silent pauses. The resulting phrases preserve syntactic validity, exhibit pitch reset, and compare well with manual tagging of prosodic boundaries. Collectively, our findings support the notion of prosodic phrases that represent coherent patterns across textual and acoustic parameters.

Funder

Israel Science Foundation

Yeda Sela

Minerva Foundation

Braginsky Centre

Publisher

Public Library of Science (PLoS)

Subject

Multidisciplinary

Reference85 articles.

1. Explaining the PENTA model: a reply to Arvaniti and Ladd;Y Xu;Phonology,2015

2. Intonation Units Revisited

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Voice Synthesis Improvement by Machine Learning of Natural Prosody;Sensors;2024-03-01

2. The Syntactic Pasts of Nouns Shape Their Prosodic Future: Lexico-Syntactic Effects on Position and Duration;Language and Speech;2023-07-04

3. The role of prosody and hand gestures in the perception of boundaries in speech✰;Speech Communication;2023-05

4. The role of live transcripts in synchronous online L2 classrooms: Learning outcomes and learner perceptions;Education and Information Technologies;2023-04-18