Crossmodal and incremental perception of audiovisual cues to emotional speech-Reference-Cited by-同舟云学术

Crossmodal and incremental perception of audiovisual cues to emotional speech

Published:2010-02-22 Issue:1 Volume:53 Page:3-30
ISSN:0023-8309
Container-title:Language and Speech
language:en
Short-container-title:Lang Speech

Author:

Barkhuysen Pashiera¹,Krahmer Emiel¹,Swerts Marc²

Affiliation:

1. Tilburg University

2. Tilburg University,

Abstract

In this article we report on two experiments about the perception of audiovisual cues to emotional speech. The article addresses two questions: (1) how do visual cues from a speaker’s face to emotion relate to auditory cues, and (2) what is the recognition speed for various facial cues to emotion? Both experiments reported below are based on tests with video clips of emotional utterances collected via a variant of the well-known Velten method. More specifically, we recorded speakers who displayed positive or negative emotions, which were congruent or incongruent with the (emotional) lexical content of the uttered sentence. In order to test this, we conducted two experiments. The first experiment is a perception experiment in which Czech participants, who do not speak Dutch, rate the perceived emotional state of Dutch speakers in a bimodal (audiovisual) or a unimodal (audio- or vision-only) condition. It was found that incongruent emotional speech leads to significantly more extreme perceived emotion scores than congruent emotional speech, where the difference between congruent and incongruent emotional speech is larger for the negative than for the positive conditions. Interestingly, the largest overall differences between congruent and incongruent emotions were found for the audio-only condition, which suggests that posing an incongruent emotion has a particularly strong effect on the spoken realization of emotions.

Publisher

SAGE Publications

Subject

Speech and Hearing,Linguistics and Language,Sociology and Political Science,Language and Linguistics,General Medicine

Link

http://journals.sagepub.com/doi/pdf/10.1177/0023830909348993

Reference47 articles.

1. Recognizing Emotion from Facial Expressions: Psychological and Neurological Mechanisms

2. Multimodal markers of irony and sarcasm

3. Can we hear the prosody of smile?

4. Vocal Expression and Perception of Emotion

5. Acoustic profiles in vocal emotion expression.

Cited by 33 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Visual channel facilitates the comprehension of the intonation of Brazilian Portuguese wh-questions and wh-exclamations: evidence from congruent and incongruent stimuli;Language and Cognition;2024-04-08

2. Bimodal Emotion Recognition Based on Vocal and Facial Features;Procedia Computer Science;2023

3. Joint modelling of audio-visual cues using attention mechanisms for emotion recognition;Multimedia Tools and Applications;2022-08-05

4. Pardo, Jennifer S., Lynne C. Nygaard, Robert E. Remez, and David B. Pisoni. 2021. The handbook of speech perception. Hoboken: Wiley Blackwell. ISBN 9781119184089 (cloth), ISBN 9781119184072 (adobe pdf);Phonetica;2022-06-01

5. Marcadores del discurso en contextos de emoción;ELUA;2022-03-21