Effectiveness of deep learning techniques in TV programs classification: A comparative analysis-Reference-Cited by-同舟云学术

Effectiveness of deep learning techniques in TV programs classification: A comparative analysis

Published:2024-07-31 Issue:4 Volume:31 Page:439-453
ISSN:1069-2509
Container-title:Integrated Computer-Aided Engineering
language:
Short-container-title:ICA

Author:

Candela Federico¹,Giordano Angelo¹,Zagaria Carmen Francesca²,Morabito Francesco Carlo³

Affiliation:

1. Dipartimento di Ingegneria dell’ Informazione, dell’ Infrastrutture e dell’ Energia Sostenibile (DIIES) Department, University Mediterranea, Reggio Calabria, Italy

2. Regional Communication Committee Calabria, Reggio Calabria, Italy

3. Dipartimento Ingegneria Civile, Energia, Ambiente e Materiali (DICEAM), University Mediterranea, Reggio Calabria, Italy

Abstract

In the application areas of streaming, social networks, and video-sharing platforms such as YouTube and Facebook, along with traditional television systems, programs’ classification stands as a pivotal effort in multimedia content management. Despite recent advancements, it remains a scientific challenge for researchers. This paper proposes a novel approach for television monitoring systems and the classification of extended video content. In particular, it presents two distinct techniques for program classification. The first one leverages a framework integrating Structural Similarity Index Measurement and Convolutional Neural Network, which pipelines on stacked frames to classify program initiation, conclusion, and contents. Noteworthy, this versatile method can be seamlessly adapted across various systems. The second analyzed framework implies directly processing optical flow. Building upon a shot-boundary detection technique, it incorporates background subtraction to adaptively discern frame alterations. These alterations are subsequently categorized through the integration of a Transformers network, showcasing a potential advancement in program classification methodology. A comprehensive overview of the promising experimental results yielded by the two techniques is reported. The first technique achieved an accuracy of 95%, while the second one surpassed it with an even higher accuracy of 87% on multiclass classification. These results underscore the effectiveness and reliability of the proposed frameworks, and pave the way for a more efficient and precise content management in the ever-evolving landscape of multimedia platforms and streaming services.

Publisher

IOS Press

Reference30 articles.

1. Agcom.2008, Available from: https://wwwagcom, it/ documents/10179/539063/Allegato+12-11-2008+13.

2. Television programs classification via deep learning approach using SSMI-CNN;Candela;In: Proceedings of the Second International Conference on Applied Intelligence and Informatics (AII 2022),2022

3. Learning spatiotemporal features with 3D convolutional networks;Tran;In: Proceedings of the IEEE International Conference on Computer Vision,2015

4. Two-stream convolutional networks for action recognition in videos;Simonyan;In: Advances in Neural Information Processing Systems,2014

5. Wang L, Xiong Y, Wang Z, Qiao Y. Temporal segment networks: Towards good practices for deep action recognition. In: Proceedings of the European Conference on Computer Vision (ECCV). 2016.