Hyperspectral Image Classification with Imbalanced Data Based on Semi-Supervised Learning-Reference-Cited by-同舟云学术

Hyperspectral Image Classification with Imbalanced Data Based on Semi-Supervised Learning

Published:2022-04-13 Issue:8 Volume:12 Page:3943
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Zheng Xiaorou^ORCID,Jia Jianxin,Chen Jinsong,Guo Shanxin,Sun Luyi^ORCID,Zhou Chan,Wang Yawei

Abstract

Hyperspectral remote sensing image classification has been widely employed for numerous applications, such as environmental monitoring, agriculture, and mineralogy. During such classification, the number of training samples in each class often varies significantly. This imbalance in the dataset is often not identified because most classifiers are designed under a balanced dataset assumption, which can distort the minority classes or even treat them as noise. This may lead to biased and inaccurate classification results. This issue can be alleviated by applying preprocessing techniques that enable a uniform distribution of the imbalanced data for further classification. However, it is difficult to add new natural features to a training model by artificial combination of samples by using existing preprocessing techniques. For minority classes with sparse samples, the addition of sufficient natural features can effectively alleviate bias and improve the generalization. For such an imbalanced problem, semi-supervised learning is a creative solution that utilizes the rich natural features of unlabeled data, which can be collected at a low cost in the remote sensing classification. In this paper, we propose a novel semi-supervised learning-based preprocessing solution called NearPseudo. In NearPseudo, pseudo-labels are created by the initialization classifier and added to minority classes with the corresponding unlabeled samples. Simultaneously, to increase reliability and reduce the misclassification cost of pseudo-labels, we created a feedback mechanism based on a consistency check to effectively select the unlabeled data and its pseudo-labels. Experiments were conducted on a state-of-the-art representative hyperspectral dataset to verify the proposed method. The experimental results demonstrate that NearPseudo can achieve better classification accuracy than other common processing methods. Furthermore, it can be flexibly applied to most typical classifiers to improve their classification accuracy. With the intervention of NearPseudo, the accuracy of random forest, k-nearest neighbors, logistic regression, and classification and regression tree increased by 1.8%, 4.0%, 6.4%, and 3.7%, respectively. This study addresses research a gap to solve the imbalanced data-based limitations in hyperspectral image classification.

Funder

Strategic Priority Research Program of the Chi- 530 nese Academy of Sciences

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/12/8/3943/pdf

Reference57 articles.

1. Diverse Region-Based CNN for Hyperspectral Image Classification

2. Hyperspectral Image Classification With Imbalanced Data Based on Orthogonal Complement Subspace Projection

3. Imbalanced Hyperspectral Image Classification Based on Maximum Margin

4. Training- and Test-Time Data Augmentation for Hyperspectral Image Segmentation

5. Learning from class-imbalanced data: Review of methods and applications

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A deep convolutional neural network for the classification of imbalanced breast cancer dataset;Healthcare Analytics;2024-06

2. Transfer Learning-Based Hyperspectral Image Classification Using Residual Dense Connection Networks;Sensors;2024-04-23

3. Deep learning techniques for hyperspectral image analysis in agriculture: A review;ISPRS Open Journal of Photogrammetry and Remote Sensing;2024-04

4. An extensive review of hyperspectral image classification and prediction: techniques and challenges;Multimedia Tools and Applications;2024-03-09

5. Two-Stream Networks for Contrastive Learning in Hyperspectral Image Classification;IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing;2024