Affiliation:
1. School of Measurement and Communication Engineering, Harbin University of Science and Technology, Harbin 150080, China
2. College of Electrical and Information Engineering, Heilongjiang Institute of Technology, Harbin 150001, China
3. College of Information and Communication Engineering, Harbin Engineering University, Harbin 150001, China
Abstract
Hyperspectral imaging is a technique that captures images of objects within a wide spectrum range, allowing for the acquisition of additional spectral information to reveal subtle variations and compositional components in the objects. Convolutional neural networks (CNNs) have shown remarkable feature extraction capabilities for HSI classification, but their ability to capture deep semantic features is limited. On the other hand, transformer models based on attention mechanisms excel at handling sequential data and have demonstrated great potential in various applications. Motivated by these two facts, this paper proposes a multiscale spectral–spatial transposed transformer (MSSTT) that captures the high-level semantic features of an HSI while preserving the spectral information as much as possible. The MSSTT consists of a spectral–spatial Inception module that extracts spectral and spatial features using multiscale convolutional kernels, and a spatial transpose Inception module that further enhances and extracts spatial information. A transformer model with a cosine attention mechanism is also included to extract deep semantic features, with the QKV matrix constrained to ensure the output remains within the activation range. Finally, the classification results are obtained by applying a linear layer to the learnable tokens. The experimental results from three public datasets show that the proposed MSSTT outperforms other deep learning methods in HSI classification. On the India Pines, Pavia University, and Salinas datasets, accuracies of 97.19%, 99.47%, and 99.90% were achieved, respectively, with a training set proportion of 5%.
Funder
Natural Science Foundation of Heilongjiang Province for Key projects, China
Postdoctoral Scientific Research Developmental Fund of Heilongjiang Province, China
Subject
Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering
Reference39 articles.
1. You, J., Li, X., Low, M., Lobell, D., and Ermon, S. (2017, January 4–9). Deep gaussian process for crop yield prediction based on remote sensing data. Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, San Francisco, CA, USA.
2. Deep learning in remote sensing: A comprehensive review and list of resources;Zhu;IEEE Geosci. Remote Sens. Mag.,2017
3. A thermal-based remote sensing technique for routine mapping of land-surface carbon, water and energy fluxes from field to regional scales;Anderson;Remote Sens. Environ.,2008
4. A Framework for Remote Sensing Images Processing Using Deep Learning Techniques;Cresson;IEEE Geosci. Remote Sens. Lett.,2019
5. A remote sensing technique for global monitoring of power plant CO2 emissions from space and related applications;Bovensmann;Atmos. Meas. Tech.,2010
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献