Label noise and self-learning label correction in cardiac abnormalities classification-Reference-Cited by-同舟云学术

Label noise and self-learning label correction in cardiac abnormalities classification

Published:2022-09-05 Issue:9 Volume:43 Page:094001
ISSN:0967-3334
Container-title:Physiological Measurement
language:
Short-container-title:Physiol. Meas.

Author:

Gallego Vázquez Cristina^ORCID,Breuss Alexander^ORCID,Gnarra Oriella^ORCID,Portmann Julian^ORCID,Madaffari Antonio^ORCID,Da Poian Giulia^ORCID

Abstract

Abstract Objective. Learning to classify cardiac abnormalities requires large and high-quality labeled datasets, which is a challenge in medical applications. Small datasets from various sources are often aggregated to meet this requirement, resulting in a final dataset prone to label noise due to inter- and intra-observer variability and different expertise. It is well known that label noise can affect the performance and generalizability of the trained models. In this work, we explore the impact of label noise and self-learning label correction on the classification of cardiac abnormalities on large heterogeneous datasets of electrocardiogram (ECG) signals. Approach. A state-of-the-art self-learning multi-class label correction method for image classification is adapted to learn a multi-label classifier for electrocardiogram signals. We evaluated our performance using 5-fold cross-validation on the publicly available PhysioNet/Computing in Cardiology (CinC) 2021 Challenge data, with full and reduced sets of leads. Due to the unknown label noise in the testing set, we tested our approach on the MNIST dataset. We investigated the performance under different levels of structured label noise for both datasets. Main results. Under high levels of noise, the cross-validation results of self-learning label correction show an improvement of approximately 3% in the challenge score for the PhysioNet/CinC 2021 Challenge dataset and an improvement in accuracy of 5% and reduction of the expected calibration error of 0.03 for the MNIST dataset. We demonstrate that self-learning label correction can be used to effectively deal with the presence of unknown label noise, also when using a reduced number of ECG leads.

Funder

Schweizerischer Nationalfonds zur Förderung der Wissenschaftlichen Forschung

Publisher

IOP Publishing

Subject

Physiology (medical),Biomedical Engineering,Physiology,Biophysics

Link

https://iopscience.iop.org/article/10.1088/1361-6579/ac89cb/pdf

Reference31 articles.

1. Classification of 12-lead ECGs: the PhysioNet-Computing in Cardiology Challenge 2020;Alday;Physiol. Meas.,2020

2. A Survey of Methods for Detection and Correction of Noisy Labels in Time Series Data

3. The MNIST database of handwritten digit images for machine learning research;Deng;IEEE Signal Process Mag.,2012