PHONETIC SEGMENTATION OF EMOTIONAL SPEECH WITH HMM-BASED METHODS-Reference-Cited by-同舟云学术

PHONETIC SEGMENTATION OF EMOTIONAL SPEECH WITH HMM-BASED METHODS

Published:2010-11 Issue:07 Volume:24 Page:1159-1179
ISSN:0218-0014
Container-title:International Journal of Pattern Recognition and Artificial Intelligence
language:en
Short-container-title:Int. J. Patt. Recogn. Artif. Intell.

Author:

MPORAS IOSIF¹,GANCHEV TODOR¹,FAKOTAKIS NIKOS¹

Affiliation:

1. Wire Communications Laboratory, Department of Electrical & Computer Engineering, University of Patras, Rion-Patras 26500, Greece

Abstract

In the present work we address the problem of phonetic segmentation of emotional speech. Investigating various traditional and recent HMM-based methods for speech segmentation, which we elaborated for the specifics of emotional speech segmentation, we demonstrate that the HMM-based method with hybrid embedded-isolated training offers advantageous segmentation accuracy, when compared to other HMM-based models used so far. The increased precision of the segmentation is a consequence of the iterative training process employed in the hybrid-training method, which refines the model parameters and the estimated phonetic boundaries taking advantage of the estimations made at previous iterations. Furthermore, we demonstrate the benefits of using purposely-built models for each target category of emotional speech, when compared to the case of one common model built solely from neutral speech. This advantage, in terms of segmentation accuracy, justifies the effort for creating and employing the purposely-built segmentation models per emotion category, since it significantly improves the overall segmentation accuracy.

Publisher

World Scientific Pub Co Pte Lt

Subject

Artificial Intelligence,Computer Vision and Pattern Recognition,Software

Link

https://www.worldscientific.com/doi/pdf/10.1142/S0218001410008329

Reference25 articles.

1. ASR for emotional speech: Clarifying the issues and enhancing performance

2. A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains

3. Automatic segmentation and labeling of speech based on Hidden Markov Models

4. R. Cowie and E. Douglas-Cowie, Profound Deafness and Speech Communication, eds. K.E. Spens and G. Plant (Whurr, London, UK, 1995) pp. 510–527.

5. Emotion recognition in human-computer interaction

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Pattern Mining Approach for Improving Speech Emotion Recognition;International Journal of Pattern Recognition and Artificial Intelligence;2022-11

2. Exploiting forced alignment of time-reversed data for improving HMM-based handwriting segmentation;Expert Systems with Applications;2019-05

3. ITERATIVE GROUP SELECTION-BASED ENHANCEMENT OF TIME-FREQUENCY MASKS FOR MISSING DATA RECOGNITION;International Journal of Pattern Recognition and Artificial Intelligence;2012-06