Author:
Yan Xuguo,Xia Xuhui,Wang Lei,Zhang Zelin
Abstract
Deep neural networks (DNNs) require large amounts of labeled data for model training. However, label noise is a common problem in datasets due to the difficulty of classification and high cost of labeling processes. Introducing the concepts of curriculum learning and progressive learning, this paper presents a novel solution that is able to handle massive noisy labels and improve model generalization ability. It proposes a new network model training strategy that considers mislabeled samples directly in the network training process. The new learning curriculum is designed to measures the complexity of the data with their distribution density in a feature space. The sample data in each category are then divided into easy-to-classify (clean samples), relatively easy-to-classify, and hard-to-classify (noisy samples) subsets according to the smallest intra-class local density with each cluster. On this basis, DNNs are trained progressively in three stages, from easy to hard, i.e., from clean to noisy samples. The experimental results demonstrate that the accuracy of image classification can be improved through data augmentation, and the classification accuracy of the proposed method is clearly higher than that of standard Inception_v2 for the NEU dataset after data augmentation, when the proportion of noisy labels in the training set does not exceed 60%. With 50% noisy labels in the training set, the classification accuracy of the proposed method outperformed recent state-of-the-art label noise learning methods, CleanNet and MentorNet. The proposed method also performed well in practical applications, where the number of noisy labels was uncertain and unevenly distributed. In this case, the proposed method not only can alleviate the adverse effects of noisy labels, but it can also improve the generalization ability of standard deep networks and their overall capability.
Funder
National Natural Science Foundation of China
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference31 articles.
1. Applying a Deep Learning Convolutional Neural Network (CNN) Approach for Building a Face Recognition System: A Review;Liyew;J. Emerg. Technol. Innov. Res.,2017
2. Survey on the use of CNN and Deep Learning in Image Classification;Dani;J. Emerg. Technol. Innov. Res.,2021
3. Yoo, J., Lee, C.H., Jea, H.M., Lee, S.K., Yoon, Y., Lee, J., and Hwang, S.U. (2022). Classification of Road Surfaces Based on CNN Architecture and Tire Acoustical Signals. Appl. Sci., 12.
4. Alghamdi, H.S. (2022). Towards Explainable Deep Neural Networks for the Automatic Detection of Diabetic Retinopathy. Appl. Sci., 12.
5. A hybrid deep learning CNN-ELM model and its application in handwritten numeral recognition;Guo;J. Comput. Inf. Syst.,2015