Assessment of Machine Learning-Based Audiovisual Quality Predictors-Reference-Cited by-同舟云学术

Assessment of Machine Learning-Based Audiovisual Quality Predictors

Published:2021-05-31 Issue:2 Volume:17 Page:1-22
ISSN:1551-6857
Container-title:ACM Transactions on Multimedia Computing, Communications, and Applications
language:en
Short-container-title:ACM Trans. Multimedia Comput. Commun. Appl.

Author:

K. Mythili¹,Narwaria Manish²^ORCID

Affiliation:

1. BITS Pilani Hyderabad Campus

2. Indian Institute of Technology Jodhpur, India

Abstract

Quality assessment of audiovisual (AV) signals is important from the perspective of system design, optimization, and management of a modern multimedia communication system. However, automatic prediction of AV quality via the use of computational models remains challenging. In this context, machine learning (ML) appears to be an attractive alternative to the traditional approaches. This is especially when such assessment needs to be made in no-reference (i.e., the original signal is unavailable) fashion. While development of ML-based quality predictors is desirable, we argue that proper assessment and validation of such predictors is also crucial before they can be deployed in practice. To this end, we raise some fundamental questions about the current approach of ML-based model development for AV quality assessment and signal processing for multimedia communication in general. We also identify specific limitations associated with the current validation strategy which have implications on analysis and comparison of ML-based quality predictors. These include a lack of consideration of: (a) data uncertainty, (b) domain knowledge, (c) explicit learning ability of the trained model, and (d) interpretability of the resultant model. Therefore, the primary goal of this article is to shed some light into mentioned factors. Our analysis and proposed recommendations are of particular importance in the light of significant interests in ML methods for multimedia signal processing (specifically in cases where human-labeled data is used), and a lack of discussion of mentioned issues in existing literature.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture

Link

https://dl.acm.org/doi/pdf/10.1145/3430376

Reference46 articles.

1. Audio-Visual Multimedia Quality Assessment: A Comprehensive Survey

2. Benjamin Belmudez. 2015. Audiovisual Quality Assessment and Prediction for Videotelephony. DOI:https://doi.org/10.1007/978-3-319-14166-4 Benjamin Belmudez. 2015. Audiovisual Quality Assessment and Prediction for Videotelephony. DOI:https://doi.org/10.1007/978-3-319-14166-4

3. Random search for hyper-parameter optimization;Bergstra James;J. Mach. Learn. Res.,2012

4. Machine Learning Interpretability: A Survey on Methods and Metrics