ARE-Net: An Improved Interactive Model for Accurate Building Extraction in High-Resolution Remote Sensing Imagery-Reference-Cited by-同舟云学术

ARE-Net: An Improved Interactive Model for Accurate Building Extraction in High-Resolution Remote Sensing Imagery

Published:2023-09-10 Issue:18 Volume:15 Page:4457
ISSN:2072-4292
Container-title:Remote Sensing
language:en
Short-container-title:Remote Sensing

Author:

Weng Qian¹²^ORCID,Wang Qin¹²,Lin Yifeng¹²,Lin Jiawen¹²^ORCID

Affiliation:

1. College of Computer and Data Science, Fuzhou University, Fuzhou 350000, China

2. Fujian Key Laboratory of Network Computing and Intelligent Information Processing, Fuzhou University, Fuzhou 350000, China

Abstract

Accurate building extraction for high-resolution remote sensing images is critical for topographic mapping, urban planning, and many other applications. Its main task is to label each pixel point as a building or non-building. Although deep-learning-based algorithms have significantly enhanced the accuracy of building extraction, fully automated methods for building extraction are limited by the requirement for a large number of annotated samples, resulting in a limited generalization ability, easy misclassification in complex remote sensing images, and higher costs due to the need for a large number of annotated samples. To address these challenges, this paper proposes an improved interactive building extraction model, ARE-Net, which adopts a deep interactive segmentation approach. In this paper, we present several key contributions. Firstly, an adaptive-radius encoding (ARE) module was designed to optimize the interaction features of clicks based on the varying shapes and distributions of buildings to provide maximum a priori information for building extraction. Secondly, a two-stage training strategy was proposed to enhance the convergence speed and efficiency of the segmentation process. Finally, some comprehensive experiments using two models of different sizes (HRNet18s+OCR and HRNet32+OCR) were conducted on the Inria and WHU building datasets. The results showed significant improvements over the current state-of-the-art method in terms of NoC90. The proposed method achieved performance enhancements of 7.98% and 13.03% with HRNet18s+OCR and 7.34% and 15.49% with HRNet32+OCR on the WHU and Inria datasets, respectively. Furthermore, the experiments demonstrated that the proposed ARE-Net method significantly reduced the annotation costs while improving the convergence speed and generalization performance.

Funder

Natural Science Foundation of Fujian Province

Fuzhou-Xiamen-Quanzhou National Independent Innovation Demonstration Zone Collaborative Innovation Platform

National Natural Science Foundation of China

Publisher

MDPI AG

Subject

General Earth and Planetary Sciences

Link

https://www.mdpi.com/2072-4292/15/18/4457/pdf

Reference54 articles.

1. An attention-based lattice network for hyperspectral image classification;Nikzad;IEEE Trans. Geosci. Remote Sens.,2022

2. Transferable convolutional neural network for weed mapping with multisensor imagery;Farooq;IEEE Trans. Geosci. Remote Sens.,2021

3. Multimodal hyperspectral unmixing: Insights from attention networks;Han;IEEE Trans. Geosci. Remote Sens.,2022

4. A Synergistical Attention Model for Semantic Segmentation of Remote Sensing Images;Li;IEEE Trans. Geosci. Remote Sens.,2023

5. A Multisensor Data Fusion Model for Semantic Segmentation in Aerial Images;Weng;IEEE Geosci. Remote Sens. Lett.,2022

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. ScribbleCDNet: Change detection on high-resolution remote sensing imagery with scribble interaction;International Journal of Applied Earth Observation and Geoinformation;2024-04