Building detection using a dense attention network from LiDAR and image data-Reference-Cited by-同舟云学术

Building detection using a dense attention network from LiDAR and image data

Published:2021-12-01 Issue:4 Volume:75 Page:209-236
ISSN:1195-1036
Container-title:Geomatica
language:fr
Short-container-title:Geomatica

Author:

Ghasemian Nafiseh¹,Wang Jinfei¹²,Reza Najafi Mohammad³

Affiliation:

1. Department of Geography and Environment, University of Western Ontario, London, ON, Canada.

2. Institute of Earth and Space Exploration, University of Western Ontario, London, ON, Canada.

3. Department of Civil and Environmental Engineering, University of Western Ontario, London, ON, Canada.

Abstract

Accurate building mapping using remote sensing data is challenging because of the complexity of building structures, particularly in populated cities. LiDAR data are widely used for building extraction because they provide height information, which can help distinguish buildings from other tall objects. However, tall trees and bridges in the vicinity of buildings can limit the application of LiDAR data, particularly in urban areas. Combining LiDAR and orthoimages can help in such situations, because orthoimages can provide information on the physical properties of objects, such as reflectance characteristics. One efficient way to combine these two data sources is to use convolutional neural networks (CNN). This study proposes a CNN architecture based on dense attention blocks for building detection in southern Toronto and Massachusetts. The stacking of information from multiple previous layers was inspired by dense attention networks (DANs). DAN blocks consist of batch normalization, convolution, dropout, and average pooling layers to extract high- and low-level features. Compared with two other widely used deep learning techniques, VGG16 and Resnet50, the proposed method has a simpler architecture and converges faster with higher accuracy. In addition, a comparison with the two other state-of-the-art deep learning methods, including U-net and ResUnet, showed that our proposed technique could achieve a higher F1-score, of 0.71, compared with 0.42 for U-net and 0.49 for ResUnet.

Publisher

Canadian Science Publishing

Subject

Earth-Surface Processes,Geography, Planning and Development

Link

https://cdnsciencepub.com/doi/pdf/10.1139/geomat-2021-0013

Reference21 articles.

1. ResUNet-a: A deep learning framework for semantic segmentation of remotely sensed data

2. Hyperspectral multiple-change detection framework based on sparse representation and support vector data description algorithms

3. Hamaguchi, R., and Hikosaka, S. 2018. Building detection from satellite imagery using ensemble of size-specific detectors. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops. pp. 187–191. doi:10.1109/CVPRW.2018.00041.

4. Effect of patch size and network architecture on a convolutional neural network approach for automatic segmentation of OCT retinal layers

5. He, K., Zhang, X., Ren, S., and Sun, J. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 770–778. doi:10.1109/CVPR.2016.90.

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Segment Anything Model-Based Building Footprint Extraction for Residential Complex Spatial Assessment Using LiDAR Data and Very High-Resolution Imagery;Remote Sensing;2024-07-20

2. Flood or Non-Flooded: A Comparative Study of State-of-the-Art Models for Flood Image Classification Using the FloodNet Dataset with Uncertainty Offset Analysis;Water;2023-02-24