Jazz Bass Transcription Using a U-Net Architecture-Reference-Cited by-同舟云学术

Jazz Bass Transcription Using a U-Net Architecture

Published:2021-03-12 Issue:6 Volume:10 Page:670
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Abeßer Jakob,Müller Meinard^ORCID

Abstract

In this paper, we adapt a recently proposed U-net deep neural network architecture from melody to bass transcription. We investigate pitch shifting and random equalization as data augmentation techniques. In a parameter importance study, we study the influence of the skip connection strategy between the encoder and decoder layers, the data augmentation strategy, as well as of the overall model capacity on the system’s performance. Using a training set that covers various music genres and a validation set that includes jazz ensemble recordings, we obtain the best transcription performance for a downscaled version of the reference algorithm combined with skip connections that transfer intermediate activations between the encoder and decoder. The U-net based method outperforms previous knowledge-driven and data-driven bass transcription algorithms by around five percentage points in overall accuracy. In addition to a pitch estimation improvement, the voicing estimation performance is clearly enhanced.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/10/6/670/pdf

Reference30 articles.

1. A real-time music-scene-description system: predominant-F0 estimation for detecting melody and bass lines in real-world audio signals

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. From Music Scores to Audio Recordings: Deep Pitch-Class Representations for Measuring Tonal Structures;Journal on Computing and Cultural Heritage;2024-07-31

2. Locally Activated Gated Neural Network for Automatic Music Genre Classification;Applied Sciences;2023-04-17

3. Comparing Deep Models and Evaluation Strategies for Multi-Pitch Estimation in Music Recordings;IEEE/ACM Transactions on Audio, Speech, and Language Processing;2022

4. Intelligent Audio Signal Processing – Do We Still Need Annotated Datasets?;Intelligent Information and Database Systems;2022

5. Machine Learning Applied to Music/Audio Signal Processing;Electronics;2021-12-10