Singing Voice Detection in Opera Recordings: A Case Study on Robustness and Generalization-Reference-Cited by-同舟云学术

Singing Voice Detection in Opera Recordings: A Case Study on Robustness and Generalization

Published:2021-05-20 Issue:10 Volume:10 Page:1214
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Krause Michael^ORCID,Müller Meinard^ORCID,Weiß Christof^ORCID

Abstract

Automatically detecting the presence of singing in music audio recordings is a central task within music information retrieval. While modern machine-learning systems produce high-quality results on this task, the reported experiments are usually limited to popular music and the trained systems often overfit to confounding factors. In this paper, we aim to gain a deeper understanding of such machine-learning methods and investigate their robustness in a challenging opera scenario. To this end, we compare two state-of-the-art methods for singing voice detection based on supervised learning: A traditional approach relying on hand-crafted features with a random forest classifier, as well as a deep-learning approach relying on convolutional neural networks. To evaluate these algorithms, we make use of a cross-version dataset comprising 16 recorded performances (versions) of Richard Wagner’s four-opera cycle Der Ring des Nibelungen. This scenario allows us to systematically investigate generalization to unseen versions, musical works, or both. In particular, we study the trained systems’ robustness depending on the acoustic and musical variety, as well as the overall size of the training dataset. Our experiments show that both systems can robustly detect singing voice in opera recordings even when trained on relatively small datasets with little variety.

Funder

Deutsche Forschungsgemeinschaft

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/10/10/1214/pdf

Reference32 articles.

1. An Introduction to Signal Processing for Singing-Voice Analysis: High Notes in the Effort to Automate the Understanding of Vocals in Music

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Study of Chinese Peking Opera Arithmetic Coding in Tan Dun’s Opera Marco Polo and the Art of Mixed Sexuality;Applied Mathematics and Nonlinear Sciences;2024-01-01

2. Analysis of the use of pop singing in musical theater singing based on data analysis;Applied Mathematics and Nonlinear Sciences;2023-10-30

3. Hierarchical Classification for Instrument Activity Detection in Orchestral Music Recordings;IEEE/ACM Transactions on Audio, Speech, and Language Processing;2023

4. Singing Voice Detection in Electronic Music with a Long-Term Recurrent Convolutional Network;Applied Sciences;2022-07-23

5. Hierarchical Classification of Singing Activity, Gender, and Type in Complex Music Recordings;ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2022-05-23