Affiliation:
1. State Key Laboratory of Geohazard Prevention and Geoenvironment Protection, Chengdu University of Technology, Chengdu 610059, China
Abstract
Semantic segmentation of remote sensing images has been widely used in environmental protection, geological disaster discovery, and natural resource assessment. With the rapid development of deep learning, convolutional neural networks (CNNs) have dominated semantic segmentation, relying on their powerful local information extraction capabilities. Due to the locality of convolution operation, it can be challenging to obtain global context information directly. However, Transformer has excellent potential in global information modeling. This paper proposes a new hybrid convolutional and Transformer semantic segmentation model called CTFuse, which uses a multi-scale convolutional attention module in the convolutional part. CTFuse is a serial structure composed of a CNN and a Transformer. It first uses convolution to extract small-size target information and then uses Transformer to embed large-size ground target information. Subsequently, we propose a spatial and channel attention module in convolution to enhance the representation ability for global information and local features. In addition, we also propose a spatial and channel attention module in Transformer to improve the ability to capture detailed information. Finally, compared to other models used in the experiments, our CTFuse achieves state-of-the-art results on the International Society of Photogrammetry and Remote Sensing (ISPRS) Vaihingen and ISPRS Potsdam datasets.
Subject
General Earth and Planetary Sciences
Reference59 articles.
1. Improved maize cultivated area estimation over a large scale combining modis–evi time series data and crop phenological information;Zhang;ISPRS J. Photogramm. Remote Sens.,2014
2. Scale sequence joint deep learning (ss-jdl) for land use and land cover classification;Zhang;Remote Sens. Environ.,2020
3. Using aerial imagery and gis in automated building footprint extraction and shape recognition for earthquake risk assessment of urban inventories;Sahar;IEEE Trans. Geosci. Remote Sens.,2010
4. Joint deep learning for land cover and land use classification;Zhang;Remote Sens. Environ.,2019
5. Fu, Y., Zhao, C., Wang, J., Jia, X., Yang, G., Song, X., and Feng, H. (2017). An improved combination of spectral and spatial features for vegetation classification in hyperspectral images. Remote Sens., 9.
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献