A Spatial–Spectral Transformer for Hyperspectral Image Classification Based on Global Dependencies of Multi-Scale Features

Author:

Ma Yunxuan1,Lan Yan1,Xie Yakun2,Yu Lanxin3,Chen Chen1,Wu Yusong1,Dai Xiaoai1

Affiliation:

1. College of Earth Sciences, Chengdu University of Technology, Chengdu 610059, China

2. Faculty of Geosciences and Environmental Engineering, Southwest Jiaotong University, Chengdu 610097, China

3. School of Statistics, East China Normal University, Shanghai 200062, China

Abstract

Vision transformers (ViTs) are increasingly utilized for HSI classification due to their outstanding performance. However, ViTs encounter challenges in capturing global dependencies among objects of varying sizes, and fail to effectively exploit the spatial–spectral information inherent in HSI. In response to this limitation, we propose a novel solution: the multi-scale spatial–spectral transformer (MSST). Within the MSST framework, we introduce a spatial–spectral token generator (SSTG) and a token fusion self-attention (TFSA) module. Serving as the feature extractor for the MSST, the SSTG incorporates a dual-branch multi-dimensional convolutional structure, enabling the extraction of semantic characteristics that encompass spatial–spectral information from HSI and subsequently tokenizing them. TFSA is a multi-head attention module with the ability to encode attention to features across various scales. We integrated TFSA with cross-covariance attention (CCA) to construct the transformer encoder (TE) for the MSST. Utilizing this TE to perform attention modeling on tokens derived from the SSTG, the network effectively simulates global dependencies among multi-scale features in the data, concurrently making optimal use of spatial–spectral information in HSI. Finally, the output of the TE is fed into a linear mapping layer to obtain the classification results. Experiments conducted on three popular public datasets demonstrate that the MSST method achieved higher classification accuracy compared to state-of-the-art (SOTA) methods.

Funder

National Key Research and Development Program of China

National Natural Science Foundation of China

Postdoctoral Innovation Talents Support Program

China Postdoctoral Science Foundation

Natural Science Foundation of Sichuan Province

Publisher

MDPI AG

Reference57 articles.

1. Srivastava, P.K., Malhi, R.K.M., Pandey, P.C., Anand, A., Singh, P., Pandey, M.K., and Gupta, A. (2020). Hyperspectral Remote Sensing, Elsevier.

2. Hyperspectral image analysis. A tutorial;Amigo;Anal. Chim. Acta,2015

3. Hyperspectral remote sensing in lithological mapping, mineral exploration, and environmental geology: An updated review;Sima;J. Appl. Remote Sens.,2021

4. Machine learning techniques for analysis of hyperspectral images to determine quality of food products: A review;Saha;Curr. Res. Food Sci.,2021

5. Application of hyperspectral imaging systems and artificial intelligence for quality assessment of fruit, vegetables and mushrooms: A review;Wieme;Biosyst. Eng.,2022

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3