Abstract
Self-training algorithm highlights the speed of training a supervised classifier through small labeled samples and large unlabeled samples. Despite its long considerable success, self-training algorithm has suffered from mislabeled samples. Local noise filters are designed to detect mislabeled samples. However, two major problem with this kind of application are: (a) Current local noise filters have not treated the spatial distribution of the nearest neighbors in different classes in much detail. (b) They are being disadvantaged when mislabeled samples are located in overlapping areas of different classes. Here, we develop an integrated architecture – self-training algorithm based on density peaks combining globally adaptive multi-local noise filter (STDP-GAMLNF), to improve detecting efficiency. Firstly, the spatial structure of the data set is revealed by density peak clustering, and it is used for empowering self-training to label unlabeled samples. In the meantime, after each epoch of labeling, GAMLNF can comprehensively judge whether a sample is a mislabeled sample from multiple classes or not, and it will reduce the influence of edge samples effectively. The corresponding experimental results conducted on eighteen UCI data sets demonstrate that GAMLNF is not sensitive to the value of the neighbor parameter k, and it is capable of adaptively finding the appropriate number of neighbors of each class.
Subject
Artificial Intelligence,Computer Vision and Pattern Recognition,Theoretical Computer Science
Reference29 articles.
1. disentangled variational auto-encoder for semi-supervised learning;Li;Information Sciences,2019
2. C. Yuan et al., semi-supervised stacked autoencoder-based deep hierarchical semantic feature for real-time fingerprint liveness detection, Journal of Real-Time Image Processing 17(1) (2020), 55–71.
3. A combination of active learning and self-learning for named entity recognition on twitter using conditional random fields;Tran;Knowledge-Based Systems,2017
4. boosting semi-supervised face recognition with noise robustness;Liu;IEEE Transactions on Circuits and Systems for Video Technology,2021
5. M. Zhang et al., traditional Chinese medicine knowledge service based on semi-supervised BERT-BiLSTM-CRF model, in: 2020 International Conference on Service Science, 2020, pp. 64–69.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献