Affiliation:
1. College of Mathematics and Computer Science, Zhejiang A & F University, Hangzhou 311300, China
2. Key Laboratory of State Forestry and Grassland Administration on Forestry Sensing Technology and Intelligent Equipment, Hangzhou 311300, China
3. Key Laboratory of Forestry Intelligent Monitoring and Information Technology of Zhejiang Province, Hangzhou 311300, China
Abstract
Different types of rural settlement agglomerations have been formed and mixed in space during the rural revitalization strategy implementation in China. Discriminating them from remote sensing images is of great significance for rural land planning and living environment improvement. Currently, there is a lack of automatic methods for obtaining information on rural settlement differentiation. In this paper, an improved encoder–decoder network structure, ASCEND-UNet, was designed based on the original UNet. It was implemented to segment and classify dispersed and clustered rural settlement buildings from high-resolution satellite images. The ASCEND-UNet model incorporated three components: firstly, the atrous spatial pyramid pooling (ASPP) multi-scale feature fusion module was added into the encoder, then the spatial and channel squeeze and excitation (scSE) block was embedded at the skip connection; thirdly, the hybrid dilated convolution (HDC) block was utilized in the decoder. In our proposed framework, the ASPP and HDC were used as multiple dilated convolution blocks to expand the receptive field by introducing a series of dilated rate convolutions. The scSE is an attention mechanism block focusing on features both in the spatial and channel dimension. A series of model comparisons and accuracy assessments with the original UNet, PSPNet, DeepLabV3+, and SegNet verified the effectiveness of our proposed model. Compared with the original UNet model, ASCEND-UNet achieved improvements of 4.67%, 2.80%, 3.73%, and 6.28% in precision, recall, F1-score and MIoU, respectively. The contributions of HDC, ASPP, and scSE modules were discussed in ablation experiments. Our proposed model obtained more accurate and stable results by integrating multiple dilated convolution blocks with an attention mechanism. This novel model enriches the automatic methods for semantic segmentation of different rural settlements from remote sensing images.
Funder
National Natural Science Foundation of China
Natural Science Foundation of Zhejiang Province
Reference40 articles.
1. Building New Countryside in China: A Geographical Perspective;Long;Land Use Policy,2010
2. Spatio-Temporal Dynamic Patterns of Farmland and Rural Settlements in Su–Xi–Chang Region: Implications for Building a New Countryside in Coastal China;Long;Land Use Policy,2009
3. Tongxiang City in the new situation of rural land comprehensive development of practice and thinking;Lou;Zhejiang Land Resour.,2019
4. Zheng, X., Wu, B., Weston, M., Zhang, J., Gan, M., Zhu, J., Deng, J., Wang, K., and Teng, L. (2017). Rural Settlement Subdivision by Using Landscape Metrics as Spatial Contextual Information. Remote Sens., 9.
5. Hoeser, T., and Kuenzer, C. (2020). Object Detection and Image Segmentation with Deep Learning on Earth Observation Data: A Review-Part I: Evolution and Recent Trends. Remote Sens., 12.