A Spatial–Spectral Transformer for Hyperspectral Image Classification Based on Global Dependencies of Multi-Scale Features-Reference-Cited by-同舟云学术

A Spatial–Spectral Transformer for Hyperspectral Image Classification Based on Global Dependencies of Multi-Scale Features

Published:2024-01-20 Issue:2 Volume:16 Page:404
ISSN:2072-4292
Container-title:Remote Sensing
language:en
Short-container-title:Remote Sensing

Author:

Ma Yunxuan¹,Lan Yan¹,Xie Yakun²,Yu Lanxin³,Chen Chen¹,Wu Yusong¹,Dai Xiaoai¹

Affiliation:

1. College of Earth Sciences, Chengdu University of Technology, Chengdu 610059, China

2. Faculty of Geosciences and Environmental Engineering, Southwest Jiaotong University, Chengdu 610097, China

3. School of Statistics, East China Normal University, Shanghai 200062, China

Abstract

Vision transformers (ViTs) are increasingly utilized for HSI classification due to their outstanding performance. However, ViTs encounter challenges in capturing global dependencies among objects of varying sizes, and fail to effectively exploit the spatial–spectral information inherent in HSI. In response to this limitation, we propose a novel solution: the multi-scale spatial–spectral transformer (MSST). Within the MSST framework, we introduce a spatial–spectral token generator (SSTG) and a token fusion self-attention (TFSA) module. Serving as the feature extractor for the MSST, the SSTG incorporates a dual-branch multi-dimensional convolutional structure, enabling the extraction of semantic characteristics that encompass spatial–spectral information from HSI and subsequently tokenizing them. TFSA is a multi-head attention module with the ability to encode attention to features across various scales. We integrated TFSA with cross-covariance attention (CCA) to construct the transformer encoder (TE) for the MSST. Utilizing this TE to perform attention modeling on tokens derived from the SSTG, the network effectively simulates global dependencies among multi-scale features in the data, concurrently making optimal use of spatial–spectral information in HSI. Finally, the output of the TE is fed into a linear mapping layer to obtain the classification results. Experiments conducted on three popular public datasets demonstrate that the MSST method achieved higher classification accuracy compared to state-of-the-art (SOTA) methods.

Funder

National Key Research and Development Program of China

National Natural Science Foundation of China

Postdoctoral Innovation Talents Support Program

China Postdoctoral Science Foundation

Natural Science Foundation of Sichuan Province

Publisher

MDPI AG

Link

https://www.mdpi.com/2072-4292/16/2/404/pdf

Reference57 articles.

1. Srivastava, P.K., Malhi, R.K.M., Pandey, P.C., Anand, A., Singh, P., Pandey, M.K., and Gupta, A. (2020). Hyperspectral Remote Sensing, Elsevier.

2. Hyperspectral image analysis. A tutorial;Amigo;Anal. Chim. Acta,2015

3. Hyperspectral remote sensing in lithological mapping, mineral exploration, and environmental geology: An updated review;Sima;J. Appl. Remote Sens.,2021

4. Machine learning techniques for analysis of hyperspectral images to determine quality of food products: A review;Saha;Curr. Res. Food Sci.,2021

5. Application of hyperspectral imaging systems and artificial intelligence for quality assessment of fruit, vegetables and mushrooms: A review;Wieme;Biosyst. Eng.,2022

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Peak-Finding Siamese Convolutional Neural Network (PF-SCNN) for Aero-Engine Hot Jet FT-IR Spectrum Classification;Aerospace;2024-08-28

2. EAS$$^2$$KAM: enhanced adaptive source-selection kernel with attention mechanism for hyperspectral image classification;Earth Science Informatics;2024-08-27

3. Transformer-enhanced two-stream complementary convolutional neural network for hyperspectral image classification;Journal of the Franklin Institute;2024-08

4. Spectral-Spatial Center-Aware Bottleneck Transformer for Hyperspectral Image Classification;Remote Sensing;2024-06-13

5. A Rapid Detection Method for Coal Ash Content in Tailings Suspension Based on Absorption Spectra and Deep Feature Extraction;Mathematics;2024-05-29