Learning with noisy labels via clean-aware sharpness-aware minimization


Huang Bin1,Zhang Ping1,Xie Ying1,xu chaoyang1ORCID


1. Putian University



Noise label learning has attracted considerable attention owing to its ability to leverage large amounts of inexpensive and imprecise data. Sharpness-aware minimization (SAM) has shown effective improvements in the generalization performance in the presence of noisy labels by introducing adversarial weight perturbations in the model parameter space. However, our experimental observations have shown that the SAM generalization bottleneck primarily stems from the difficulty of finding the correct adversarial perturbation amidst the noisy data. To address this problem, a theoretical analysis of the mismatch in the direction of the parameter perturbation between noise and clean samples during the training process was conducted. Based on these analyses, a clean-aware sharpness-aware minimization algorithm known as CA-SAM is proposed. CA-SAM dynamically divides the training data into possible likely clean and noisy datasets based on the historical model output and uses likely clean samples to determine the direction of the parameter perturbation. By searching for flat minima in the loss landscape, the objective was to restrict the gradient perturbation direction of noisy samples to align them while preserving the clean samples. By conducting comprehensive experiments and scrutinizing benchmark datasets containing diverse noise patterns and levels, it is demonstrated that our CA-SAM outperforms certain innovative approaches by a substantial margin.


Springer Science and Business Media LLC

Reference86 articles.

1. Krizhevsky, Alex and Sutskever, Ilya and Hinton, Geoffrey E (2012) Imagenet classification with deep convolutional neural networks. 1097--1105, Advances in neural information processing systems

2. Zhang, Chiyuan and Bengio, Samy and Hardt, Moritz and Recht, Benjamin and Vinyals, Oriol (2021) Understanding deep learning (still) requires rethinking generalization. Communications of the ACM 64(3): 107--115 ACM New York, NY, USA

3. Shorten, Connor and Khoshgoftaar, Taghi M (2019) A survey on image data augmentation for deep learning. Journal of Big Data 6(1): 60 Springer

4. Srivastava, Nitish and Hinton, Geoffrey and Krizhevsky, Alex and Sutskever, Ilya and Salakhutdinov, Ruslan (2014) Dropout: a simple way to prevent neural networks from overfitting. The journal of machine learning research 15(1): 1929--1958 JMLR. org

5. Ioffe, Sergey and Szegedy, Christian (2015) Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. 448--456, International Conference on Machine Learning








Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3