MCPT: Mixed Convolutional Parallel Transformer for Polarimetric SAR Image Classification-Reference-Cited by-同舟云学术

MCPT: Mixed Convolutional Parallel Transformer for Polarimetric SAR Image Classification

Published:2023-06-05 Issue:11 Volume:15 Page:2936
ISSN:2072-4292
Container-title:Remote Sensing
language:en
Short-container-title:Remote Sensing

Author:

Wang Wenke¹^ORCID,Wang Jianlong¹^ORCID,Lu Bibo¹,Liu Boyuan¹,Zhang Yake²,Wang Chunyang¹^ORCID

Affiliation:

1. School of Computer Science and Technology, Henan Polytechnic University, Jiaozuo 454003, China

2. School of Computer and Information Engineering, Henan Normal University, Xinxiang 453007, China

Abstract

Vision transformers (ViT) have the characteristics of massive training data and complex model, which cannot be directly applied to polarimetric synthetic aperture radar (PolSAR) image classification tasks. Therefore, a mixed convolutional parallel transformer (MCPT) model based on ViT is proposed for fast PolSAR image classification. First of all, a mixed depthwise convolution tokenization is introduced. It replaces the learnable linear projection in the original ViT to obtain patch embeddings. The process of tokenization can reduce computational and parameter complexity and extract features of different receptive fields as input to the encoder. Furthermore, combining the idea of shallow networks with lower latency and easier optimization, a parallel encoder is implemented by pairing the same modules and recombining to form parallel blocks, which can decrease the network depth and computing power requirement. In addition, the original class embedding and position embedding are removed during tokenization, and a global average pooling layer is added after the encoder for category feature extraction. Finally, the experimental results on AIRSAR Flevoland and RADARSAT-2 San Francisco datasets show that the proposed method achieves a significant improvement in training and prediction speed. Meanwhile, the overall accuracy achieved was 97.9% and 96.77%, respectively.

Funder

National Natural Science Foundation of China

Doctoral Foundation of Henan Polytechnic University

Henan Provincial Science and Technology Research Project

Key Research Project Fund of Institution of Higher Education in Henan Province

Publisher

MDPI AG

Subject

General Earth and Planetary Sciences

Link

https://www.mdpi.com/2072-4292/15/11/2936/pdf

Reference68 articles.

1. An introduction to synthetic aperture radar (SAR);Chan;Prog. Electromagn. Res. B,2008

2. Principles of Synthetic Aperture Radar;Bamler;Surv. Geophys.,2000

3. Pasmurov, A., and Zinoviev, J. (2005). Radar Imaging and Holography, IET Digital Library.

4. Signal-to-Clutter Ratio Enhancement in Bistatic Very High Frequency (VHF)-Band SAR Images of Truck Vehicles in Forested and Urban Terrain;Ulander;IET Radar Sonar Navig.,2010

5. Spectral Clustering Ensemble Applied to SAR Image Segmentation;Zhang;IEEE Trans. Geosci. Remote Sens.,2008

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A survey of the vision transformers and their CNN-transformer based variants;Artificial Intelligence Review;2023-10-04