Affiliation:
1. School of Computer Science, Northwestern Polytechnical University, Xi’an 710129, China
2. Department of Computer Technology and Application, Qinghai University, Xi’ning 810016, China
Abstract
Semantic labeling of very high-resolution remote sensing images (VHRRSI) has emerged as a crucial research area in remote sensing image interpretation. However, challenges arise due to significant variations in target orientation and scale, particularly for small targets that are more prone to obscuration and misidentification. The high interclass similarity and low intraclass similarity further exacerbate difficulties in distinguishing objects with similar color and geographic location. To address this concern, we introduce a self-cascading multiscale network (ScasMNet) based on a fully convolutional network, aimed at enhancing the segmentation precision for each category in remote sensing images (RSIs). In ScasMNet, cropped Digital Surface Model (DSM) data and corresponding RGB data are fed into the network via two distinct paths. In the encoder stage, one branch utilizes convolution to extract height information from DSM images layer by layer, enabling better differentiation of trees and low vegetation with similar color and geographic location. A parallel branch extracts spatial, color, and texture information from the RGB data. By cascading the features of different layers, the heterogeneous data are fused to generate complementary discriminative characteristics. Lastly, to refine segmented edges, fully conditional random fields (DenseCRFs) are employed for postprocessing presegmented images. Experimental findings showcase that ScasMNet achieves an overall accuracy (OA) of 92.74% on two challenging benchmarks, demonstrating its outstanding performance, particularly for small-scale objects. This demonstrates that ScasMNet ranks among the state-of-the-art methods in addressing challenges related to semantic segmentation in RSIs.
Funder
National Natural Science Foundation of China
Shaanxi Provincial Key R&D Program
Reference60 articles.
1. Deep learning for hyperspectral image classification: An overview;Li;IEEE Trans. Geosci. Remote Sens.,2019
2. Hyperspectral image classification with deep learning models;Yang;IEEE Trans. Geosci. Remote Sens.,2018
3. Yao, H., Yu, Q., Xing, X., He, F., and Ma, J. (2017, January 26–28). Deep-learning-based moving target detection for unmanned air vehicles. Proceedings of the 2017 36th Chinese Control Conference (CCC), Dalian, China.
4. Automatic target detection in satellite images using deep learning;Khan;J. Space Technol.,2017
5. Multiscale grid method for detection and reconstruction of building roofs from airborne LiDAR data;Chen;IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens.,2014