Affiliation:
1. School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, China
2. School of Geoscience and Technology, Zhengzhou University, Zhengzhou 450001, China
3. Ecometeorology Joint Laboratory of Zhengzhou University and Chinese Academy of Meteorological Science, Zhengzhou 450001, China
Abstract
The precise building extraction from high-resolution remote sensing images holds significant application for urban planning, resource management, and environmental conservation. In recent years, deep neural networks (DNNs) have garnered substantial attention for their adeptness in learning and extracting features, becoming integral to building extraction methodologies and yielding noteworthy performance outcomes. Nonetheless, prevailing DNN-based models for building extraction often overlook spatial information during the feature extraction phase. Additionally, many existing models employ a simplistic and direct approach in the feature fusion stage, potentially leading to spurious target detection and the amplification of internal noise. To address these concerns, we present a multi-scale attention network (MSANet) tailored for building extraction from high-resolution remote sensing images. In our approach, we initially extracted multi-scale building feature information, leveraging the multi-scale channel attention mechanism and multi-scale spatial attention mechanism. Subsequently, we employed adaptive hierarchical weighting processes on the extracted building features. Concurrently, we introduced a gating mechanism to facilitate the effective fusion of multi-scale features. The efficacy of the proposed MSANet was evaluated using the WHU aerial image dataset and the WHU satellite image dataset. The experimental results demonstrate compelling performance metrics, with the F1 scores registering at 93.76% and 77.64% on the WHU aerial imagery dataset and WHU satellite dataset II, respectively. Furthermore, the intersection over union (IoU) values stood at 88.25% and 63.46%, surpassing benchmarks set by DeepLabV3 and GSMC.
Funder
Second Tibetan Plateau Scientific Expedition and Research (STEP) program
Key Technologies Research on Development and Service of Yellow River Simulator for Super-computing Platform
Reference46 articles.
1. Land use classification using improved U-Net in remote sensing images of urban and rural planning monitoring;Xie;Sci. Program.,2022
2. Interpretable deep learning framework for land use and land cover classification in remote sensing using SHAP;Temenos;IEEE Geosci. Remote Sens. Lett.,2023
3. The building recognition and analysis of remote sensing image based on depth belief network;Chen;Cogn. Syst. Res.,2021
4. Spatial-temporal impacts of urban land use land cover on land surface temperature: Case studies of two Canadian urban areas;Zhang;Int. J. Appl. Earth Obs. Geoinf.,2019
5. Automated detection of buildings from single VHR multispectral images using shadow information and graph cuts;Ok;ISPRS J. Photogramm. Remote Sens.,2013