QoE Estimation of WebRTC-based Audio-visual Conversations from Facial and Speech Features-Reference-Cited by-同舟云学术

QoE Estimation of WebRTC-based Audio-visual Conversations from Facial and Speech Features

Published:2024-01-22 Issue:5 Volume:20 Page:1-23
ISSN:1551-6857
Container-title:ACM Transactions on Multimedia Computing, Communications, and Applications
language:en
Short-container-title:ACM Trans. Multimedia Comput. Commun. Appl.

Author:

Bingöl Gülnaziye¹^ORCID,Porcu Simone¹^ORCID,Floris Alessandro¹^ORCID,Atzori Luigi¹^ORCID

Affiliation:

1. DIEE, University of Cagliari, Italy and CNIT, University of Cagliari, Italy

Abstract

The utilization of user’s facial- and speech-related features for the estimation of the Quality of Experience (QoE) of multimedia services is still underinvestigated despite its potential. Currently, only the use of either facial or speech features individually has been proposed, and relevant limited experiments have been performed. To advance in this respect, in this study, we focused on WebRTC-based videoconferencing, where it is often possible to capture both the facial expressions and vocal speech characteristics of the users. First, we performed thorough statistical analysis to identify the most significant facial- and speech-related features for QoE estimation, which we extracted from the participants’ audio-video data collected during a subjective assessment. Second, we trained individual QoE estimation machine learning-based models on the separated facial and speech datasets. Finally, we employed data fusion techniques to combine the facial and speech datasets into a single dataset to enhance the QoE estimation performance due to the integrated knowledge provided by the fusion of facial and speech features. The obtained results demonstrate that the data fusion technique based on the Improved Centered Kernel Alignment (ICKA) allows for reaching a mean QoE estimation accuracy of 0.93, whereas the values of 0.78 and 0.86 are reached when using only facial or speech features, respectively.

Funder

European Union under the Italian National Recovery and Resilience Plan (NRRP) of NextGenerationEU

Sustainable Mobility Center

Centro Nazionale per la Mobilit Sostenibile, CNMS

Dottorati e contratti di ricerca su tematiche dell” innovazione

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture

Link

https://dl.acm.org/doi/pdf/10.1145/3638251

Reference61 articles.

1. QoE assessment of interactive applications in computer networks

2. L. Amour, M. I. Boulabiar, S. Souihi, and A. Mellouk. 2018. An improved QoE estimation method based on QoS and affective computing. In International Symposium on Programming and Systems (ISPS’18). 1–6.

3. T. Baltrus̆aitis, M. Mahmoud, and P. Robinson. 2015. Cross-dataset learning and person-specific normalisation for automatic Action Unit detection. In 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG’15), Vol. 06. 1–6.

4. T. Baltrus̆aitis, A. Zadeh, Y. C. Lim, and L. Morency. 2018. OpenFace 2.0: Facial behavior analysis toolkit. In 13th IEEE International Conference on Automatic Face Gesture Recognition (FG’18). 59–66.

5. Survey of research on Quality of experience modelling for web browsing;Barakovic Sabina;Qual. User Exper.,2017

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Evolving Feature Selection: Synergistic Backward and Forward Deletion Method Utilizing Global Feature Importance;IEEE Access;2024