LaplaceConfidence: A graph-based approach for learning with noisy labels-Reference-Cited by-同舟云学术

LaplaceConfidence: A graph-based approach for learning with noisy labels

Published:2024-04-12 Issue: Volume: Page:1-17
ISSN:1088-467X
Container-title:Intelligent Data Analysis
language:
Short-container-title:IDA

Author:

Chen Mingcai¹,Du Yuntao¹,Tang Wei²,Zhang Baoming¹,Wang Chongjun¹

Affiliation:

1. State Key Laboratory for Novel Software Technology at Nanjing University, Nanjing University, Nanjing, Jiangsu, China

2. Department of Neurology, University Medical Center Groningen, University of Groningen, Groningen, the Netherlands

Abstract

Real-world machine learning applications seldom provide perfect labeled data, posing a challenge in developing models robust to noisy labels. Recent methods prioritize noise filtering based on the discrepancies between model predictions and the provided noisy labels, assuming samples with minimal classification losses to be clean. In this work, we capitalize on the consistency between the learned model and the complete noisy dataset, employing the data’s rich representational and topological information. We introduce LaplaceConfidence, a method that to obtain label confidence (i.e., clean probabilities) utilizing the Laplacian energy. Specifically, it first constructs graphs based on the feature representations of all noisy samples and minimizes the Laplacian energy to produce a low-energy graph. Clean labels should fit well into the low-energy graph while noisy ones should not, allowing our method to determine data’s clean probabilities. Furthermore, LaplaceConfidence is embedded into a holistic method for robust training, where co-training technique generates unbiased label confidence and label refurbishment technique better utilizes it. We also explore the dimensionality reduction technique to accommodate our method on large-scale noisy datasets. Our experiments demonstrate that LaplaceConfidence outperforms state-of-the-art methods on benchmark datasets under both synthetic and real-world noise. Code available at https://github.com/chenmc1996/LaplaceConfidence.

Publisher

IOS Press

Reference45 articles.

1. A Survey on Data Collection for Machine Learning: A Big Data – AI Integration Perspective;Roh;IEEE Transactions on Knowledge and Data Engineering,2021

2. B. Han, Q. Yao, T. Liu, G. Niu, I.W. Tsang, J.T. Kwok and M. Sugiyama, A Survey of Label-noise Representation Learning: Past, Present and Future, CoRR abs/2011.04406 (2020). https://arxiv.org/abs/2011.04406.

3. A brief introduction to weakly supervised learning;Zhou;National Science Review,2018

4. A closer look at memorization in deep networks;Arpit;International Conference on Machine Learning,2017

5. Mentornet: Learning data-driven curriculum for very deep neural networks on corrupted labels;Jiang;International Conference on Machine Learning,2018