Affiliation:
1. School of Computer Science and Engineering, Xi’an Technological University, Xi’an 710021, China
2. School of Statistics, Lanzhou University of Finance and Economics, Lanzhou 730101, China
Abstract
Semantic segmentation of remote sensing images poses a formidable challenge within this domain. Our investigation commences with a pilot study aimed at scrutinizing the advantages and disadvantages of employing a Transformer architecture and a CNN architecture in remote sensing imagery (RSI). Our objective is to substantiate the indispensability of both local and global information for RSI analysis. In this research article, we harness the potential of the Transformer model to establish global contextual understanding while incorporating an additional convolution module for localized perception. Nonetheless, a direct fusion of these heterogeneous information sources often yields subpar outcomes. To address this limitation, we propose an innovative hierarchical fusion feature information module that this model can fuse Transformer and CNN features using an ensemble-to-set approach, thereby enhancing information compatibility. Our proposed model, named FURSformer, amalgamates the strengths of the Transformer architecture and CNN. The experimental results clearly demonstrate the effectiveness of this approach. Notably, our model achieved an outstanding accuracy of 90.78% mAccuracy on the DLRSD dataset.
Subject
Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering
Reference42 articles.
1. Seed region growing based on gradient vector flow for medical image segmentation;Liu;Comput. Med. Imaging Graph.,2008
2. Effective Image Segmentation using Fuzzy C Means Clustering with Morphological Processing;Oka;Int. J. Image Graph. Signal Process.,2012
3. Land use changes modelling using advanced methods: Cellular automata and artificial neural networks. The spatial and explicit representation of land cover dynamics at the cross-border region scale;Basse;Appl. Geogr.,2014
4. Imagenet classification with deep convolutional neural networks;Krizhevsky;Commun. ACM,2017
5. Deng, J., Wei, D., Socher, R., Li, L.-J., Li, K., and Li, F.-F. (2009, January 20–25). ImageNet: A large-scale hierarchical image database. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Miami, FL, USA.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献