Extending Radio Broadcasting Semantics through Adaptive Audio Segmentation Automations-Reference-Cited by-同舟云学术

Extending Radio Broadcasting Semantics through Adaptive Audio Segmentation Automations

Published:2022-07-18 Issue:3 Volume:2 Page:347-364
ISSN:2673-9585
Container-title:Knowledge
language:en
Short-container-title:Knowledge

Author:

Kotsakis Rigas,Dimoulas Charalampos^ORCID

Abstract

The present paper focuses on adaptive audio detection, segmentation and classification techniques in audio broadcasting content, dedicated mainly to voice data. The suggested framework addresses a real case scenario encountered in media services and especially radio streams, aiming to fulfill diverse (semi-) automated indexing/annotation and management necessities. In this context, aggregated radio content is collected, featuring small input datasets, which are utilized for adaptive classification experiments, without searching, at this point, for a generic pattern recognition solution. Hierarchical and hybrid taxonomies are proposed, firstly to discriminate voice data in radio streams and thereafter to detect single speaker voices, and when this is the case, the experiments proceed into a final layer of gender classification. It is worth mentioning that stand-alone and combined supervised and clustering techniques are tested along with multivariate window tuning, towards the extraction of meaningful results based on overall and partial performance rates. Furthermore, the current work via data augmentation mechanisms contributes to the formulation of a dynamic Generic Audio Classification Repository to be subjected, in the future, to adaptive multilabel experimentation with more sophisticated techniques, such as deep architectures.

Publisher

MDPI AG

Link

https://www.mdpi.com/2673-9585/2/3/20/pdf

Reference30 articles.

1. Investigation of broadcast-audio semantic analysis scenarios employing radio-programme-adaptive pattern classification

2. Moderation Techniques for Social Media Content;Veglis,2014

3. Only overlay text: novel features for TV news broadcast video segmentation

4. Emotional Prediction and Content Profile Estimation in Evaluating Audiovisual Mediated Communication

5. Continuous Speech Emotion Recognition with Convolutional Neural Networks

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Exploring music listening patterns: an online survey;International Journal of Electronics and Telecommunications;2024-06-25

2. “Give me happy pop songs in C major and with a fast tempo”: A vocal assistant for content-based queries to online music repositories;International Journal of Human-Computer Studies;2023-05