HyperSFormer: A Transformer-Based End-to-End Hyperspectral Image Classification Method for Crop Classification-Reference-Cited by-同舟云学术

HyperSFormer: A Transformer-Based End-to-End Hyperspectral Image Classification Method for Crop Classification

Published:2023-07-11 Issue:14 Volume:15 Page:3491
ISSN:2072-4292
Container-title:Remote Sensing
language:en
Short-container-title:Remote Sensing

Author:

Xie Jiaxing¹²³,Hua Jiajun¹,Chen Shaonan¹,Wu Peiwen¹,Gao Peng¹^ORCID,Sun Daozong¹²³,Lyu Zhendong¹,Lyu Shilei¹²³,Xue Xiuyun¹²³,Lu Jianqiang¹²³^ORCID

Affiliation:

1. College of Electronic Engineering (College of Artificial Intelligence), South China Agricultural University, Guangzhou 510642, China

2. Laboratory of Lingnan Modern Agriculture Science and Technology Guangdong Experimental Heyuan Branch, Heyuan 514000, China

3. Engineering Research Center for Monitoring Agricultural Information of Guangdong Province, Guangzhou 510642, China

Abstract

Crop classification of large-scale agricultural land is crucial for crop monitoring and yield estimation. Hyperspectral image classification has proven to be an effective method for this task. Most current popular hyperspectral image classification methods are based on image classification, specifically on convolutional neural networks (CNNs) and recurrent neural networks (RNNs). In contrast, this paper focuses on methods based on semantic segmentation and proposes a new transformer-based approach called HyperSFormer for crop hyperspectral image classification. The key enhancement of the proposed method is the replacement of the encoder in SegFormer with an improved Swin Transformer while keeping the SegFormer decoder. The entire model adopts a simple and uniform transformer architecture. Additionally, the paper introduces the hyper patch embedding (HPE) module to extract spectral and local spatial information from the hyperspectral images, which enhances the effectiveness of the features used as input for the model. To ensure detailed model processing and achieve end-to-end hyperspectral image classification, the transpose padding upsample (TPU) module is proposed for the model’s output. In order to address the problem of insufficient and imbalanced samples in hyperspectral image classification, the paper designs an adaptive min log sampling (AMLS) strategy and a loss function that incorporates dice loss and focal loss to assist model training. Experimental results using three public hyperspectral image datasets demonstrate the strong performance of HyperSFormer, particularly in the presence of imbalanced sample data, complex negative samples, and mixed sample classes. HyperSFormer outperforms state-of-the-art methods, including fast patch-free global learning (FPGA), a spectral–spatial-dependent global learning framework (SSDGL), and SegFormer, by at least 2.7% in the mean intersection over union (mIoU). It also improves the overall accuracy and average accuracy values by at least 0.9% and 0.3%, respectively, and the kappa coefficient by at least 0.011. Furthermore, ablation experiments were conducted to determine the optimal hyperparameter and loss function settings for the proposed method, validating the rationality of these settings and the fusion loss function.

Publisher

MDPI AG

Subject

General Earth and Planetary Sciences

Link

https://www.mdpi.com/2072-4292/15/14/3491/pdf

Reference34 articles.

1. Radar Remote Sensing of Agricultural Canopies: A Review;McNairn;IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens.,2017

2. Advanced Spectral Classifiers for Hyperspectral Images: A Review;Ghamisi;IEEE Geosci. Remote Sens. Mag.,2017

3. An Overview of Crop Nitrogen Status Assessment Using Hyperspectral Remote Sensing: Current Status and Perspectives;Fu;Eur. J. Agron.,2021

4. He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27–30). Deep Residual Learning for Image Recognition. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.

5. A Spectral-Spatial-Dependent Global Learning Framework for Insufficient and Imbalanced Hyperspectral Image Classification;Zhu;IEEE Trans. Cybern.,2021

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Study on the Prediction Model of Litchi Downy Blight Damage Based on IoT and Hyperspectral Data Fusion;IEEE Internet of Things Journal;2024-08-15

2. Deepfake detection using convolutional vision transformers and convolutional neural networks;Neural Computing and Applications;2024-08-08

3. A multi-scale multi-channel CNN introducing a channel-spatial attention mechanism hyperspectral remote sensing image classification method;European Journal of Remote Sensing;2024-05-27

4. Hyperspectral crop image classification via ensemble of classification model with optimal training;Web Intelligence;2024-04-26

5. Cross-and-Diagonal Networks: An Indirect Self-Attention Mechanism for Image Classification;Sensors;2024-03-23