Excitation Features of Speech for Emotion Recognition Using Neutral Speech as Reference

Author:

Kadiri Sudarsana ReddyORCID,Gangamohan P.,Gangashetty Suryakanth V.,Alku PaavoORCID,Yegnanarayana B.ORCID

Abstract

AbstractIn generation of emotional speech, there are deviations in the speech production features when compared to neutral (non-emotional) speech. The objective of this study is to capture the deviations in features related to the excitation component of speech and to develop a system for automatic recognition of emotions based on these deviations. The emotions considered in this study are anger, happiness, sadness and neutral state. The study shows that there are useful features in the deviations of the excitation features, which can be exploited to develop an emotion recognition system. The excitation features used in this study are the instantaneous fundamental frequency ($$F_0$$ F 0 ), the strength of excitation, the energy of excitation and the ratio of the high-frequency to low-frequency band energy ($$\beta $$ β ). A hierarchical binary decision tree approach is used to develop an emotion recognition system with neutral speech as reference. The recognition experiments showed that the excitation features are comparable or better than the existing prosody features and spectral features, such as mel-frequency cepstral coefficients, perceptual linear predictive coefficients and modulation spectral features.

Funder

The Academy of Finland

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Signal Processing

Cited by 25 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Epoch extraction in real-world scenario;International Journal of Speech Technology;2024-09

2. On the Use of Pitch-Based Features for Detecting Simultaneous Fear Emotion and Deception Behavior From Speech;International Journal of Pattern Recognition and Artificial Intelligence;2024-06-29

3. Development of a method for recognizing emotions from a speech signal;Proceedings of the Southwest State University. Series: IT Management, Computer Science, Computer Engineering. Medical Equipment Engineering;2024-06-28

4. Hierarchical speech emotion recognition using the valence-arousal model;Multimedia Tools and Applications;2024-06-14

5. The Sound of Emotional Prosody: Nearly 3 Decades of Research and Future Directions;Perspectives on Psychological Science;2024-01-17

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3