Effectiveness of deep learning techniques in TV programs classification: A comparative analysis

Author:

Candela Federico1,Giordano Angelo1,Zagaria Carmen Francesca2,Morabito Francesco Carlo3

Affiliation:

1. Dipartimento di Ingegneria dell’ Informazione, dell’ Infrastrutture e dell’ Energia Sostenibile (DIIES) Department, University Mediterranea, Reggio Calabria, Italy

2. Regional Communication Committee Calabria, Reggio Calabria, Italy

3. Dipartimento Ingegneria Civile, Energia, Ambiente e Materiali (DICEAM), University Mediterranea, Reggio Calabria, Italy

Abstract

In the application areas of streaming, social networks, and video-sharing platforms such as YouTube and Facebook, along with traditional television systems, programs’ classification stands as a pivotal effort in multimedia content management. Despite recent advancements, it remains a scientific challenge for researchers. This paper proposes a novel approach for television monitoring systems and the classification of extended video content. In particular, it presents two distinct techniques for program classification. The first one leverages a framework integrating Structural Similarity Index Measurement and Convolutional Neural Network, which pipelines on stacked frames to classify program initiation, conclusion, and contents. Noteworthy, this versatile method can be seamlessly adapted across various systems. The second analyzed framework implies directly processing optical flow. Building upon a shot-boundary detection technique, it incorporates background subtraction to adaptively discern frame alterations. These alterations are subsequently categorized through the integration of a Transformers network, showcasing a potential advancement in program classification methodology. A comprehensive overview of the promising experimental results yielded by the two techniques is reported. The first technique achieved an accuracy of 95%, while the second one surpassed it with an even higher accuracy of 87% on multiclass classification. These results underscore the effectiveness and reliability of the proposed frameworks, and pave the way for a more efficient and precise content management in the ever-evolving landscape of multimedia platforms and streaming services.

Publisher

IOS Press

Reference30 articles.

1. Agcom.2008, Available from: https://wwwagcom, it/ documents/10179/539063/Allegato+12-11-2008+13.

2. Television programs classification via deep learning approach using SSMI-CNN;Candela;In: Proceedings of the Second International Conference on Applied Intelligence and Informatics (AII 2022),2022

3. Learning spatiotemporal features with 3D convolutional networks;Tran;In: Proceedings of the IEEE International Conference on Computer Vision,2015

4. Two-stream convolutional networks for action recognition in videos;Simonyan;In: Advances in Neural Information Processing Systems,2014

5. Wang L, Xiong Y, Wang Z, Qiao Y. Temporal segment networks: Towards good practices for deep action recognition. In: Proceedings of the European Conference on Computer Vision (ECCV). 2016.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3