Automatic Label Calibration for Singing Annotation Using Fully Convolutional Neural Network-Reference-Cited by-同舟云学术

Automatic Label Calibration for Singing Annotation Using Fully Convolutional Neural Network

Published:2023-04-04 Issue:6 Volume:18 Page:945-952
ISSN:1931-4973
Container-title:IEEJ Transactions on Electrical and Electronic Engineering
language:en
Short-container-title:IEEJ Transactions Elec Engng

Author:

Fu Xiao¹,Deng Hangyu¹,Hu Jinglu¹

Affiliation:

1. Graduate School of Information, Production and Systems Waseda University 2–7, Hibikino, Kitakyushu Fukuoka 808–0135 Japan

Abstract

Accurately‐labeled data is crucial for the training of machine learning models. For singing‐related tasks in the music information retrieval field, accurately‐labeled data is limited because annotating singing is time‐consuming. Several studies create vocal datasets using a two‐step annotation method which creates coarse labels first and then executes a manual calibration procedure. However, manually calibrating coarsely‐labeled singing data is expensive and time‐consuming. To address this problem, in this study we propose a singing‐label calibration framework, which aims to automatically calibrate the coarsely‐labeled singing data with higher accuracy. This framework contains a data augmentation method to generate training and testing data, a reasonable data preprocessing method to handle music audio and symbolic labels, a fully‐convolutional neural network to estimate the difference between coarse labels and accurate labels, and a novel calibration function to correct the coarse labels. Various experiments are conducted to examine the effect of our research. The results show that our model can highly reduce the cost time and slightly increase the labeling accuracy of the manual calibration process. © 2023 Institute of Electrical Engineers of Japan. Published by Wiley Periodicals LLC.

Publisher

Wiley

Subject

Electrical and Electronic Engineering

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1002/tee.23804

Reference20 articles.

1. NishikimiR NakamuraE FukayamaS GotoM YoshiiK.Automatic singing transcription based on encoder‐decoder recurrent neural networks with a weakly‐supervised attention mechanism. inICASSP2019–2019 IEEE International Conference on Acoustics Speech and Signal Processing(ICASSP). IEEE 2019;161–165.

2. FuX YuanX HuJ.Hsd: A hierarchical singing annotation dataset.2022 IEEE International Symposium on Multimedia(ISM) 2022;245–246.

3. J.‐Y.WangandJ.‐S. R.Jang On the preparation and validation of a largescale dataset of singing transcription.ICASSP 2021–2021 IEEE International Conference on Acoustics Speech and Signal Processing(ICASSP) 2021;276–280.

4. RyynänenM KlapuriA.Transcription of the singing melody in polyphonic music.ISMIR 2006;222–227.

5. FuZ‐S SuL.Hierarchical classification networks for singing voice segmentation and transcription. InProceedings of the 20th International Society for Music Information Retrieval Conference(ISMIR 2019) 2019;900–907.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Improving piano music signal recognition through enhanced frequency domain analysis;Journal of Measurements in Engineering;2024-02-23

2. Transfer Learning Using Musical Instrument Audio for Improving Automatic Singing Label Calibration;IEEJ Transactions on Electrical and Electronic Engineering;2024-02-11

3. Development Status, Frontier Hotspots, and Technical Evaluations in the Field of AI Music Composition Since the 21st Century: A Systematic Review;IEEE Access;2024