RoadFormer: Road Extraction Using a Swin Transformer Combined with a Spatial and Channel Separable Convolution-Reference-Cited by-同舟云学术

RoadFormer: Road Extraction Using a Swin Transformer Combined with a Spatial and Channel Separable Convolution

Published:2023-02-15 Issue:4 Volume:15 Page:1049
ISSN:2072-4292
Container-title:Remote Sensing
language:en
Short-container-title:Remote Sensing

Author:

Liu Xiangzeng¹^ORCID,Wang Ziyao¹,Wan Jinting¹,Zhang Juli²,Xi Yue³,Liu Ruyi¹,Miao Qiguang¹^ORCID

Affiliation:

1. School of Computer Science and Technology, Xidian University, Xi’an 710071, China

2. Academy of Advanced Interdisciplinary Research, Xidian University, Xi’an 710071, China

3. Guangzhou Institute of Technology, Xidian University, Guangzhou 510555, China

Abstract

The accurate detection and extraction of roads using remote sensing technology are crucial to the development of the transportation industry and intelligent perception tasks. Recently, in view of the advantages of CNNs in feature extraction, its related road extraction methods have been proposed successively. However, due to the limitation of kernel size, they perform less effectively at capturing long-range information and global context, which are crucial for road targets distributed over long distances and highly structured. To deal with this problem, a novel model named RoadFormer with a Swin Transformer as the backbone is developed in this paper. Firstly, to extract long-range information effectively, a Swin Transformer multi-scale encoder is adopted in our model. Secondly, to enhance the feature representation capability of the model, we design an innovative bottleneck module, in which the spatial and channel separable convolution is employed to obtain fine-grained and globe features, and then a dilated block is connected after the spatial convolution module to capture more integrated road structures. Finally, a lightweight decoder consisting of transposed convolution and skip connection generates the final extraction results. Extensive experimental results confirm the advantages of RoadFormer on the Deepglobe and Massachusetts datasets. The comparative results of visualization and quantification demonstrate that our model outperforms comparable methods.

Funder

the National Key Research and Development Program of China

The Key R&D Projects of Qingdao Science and Technology Plan

Publisher

MDPI AG

Subject

General Earth and Planetary Sciences

Link

https://www.mdpi.com/2072-4292/15/4/1049/pdf

Reference44 articles.

1. Simultaneous Road Surface and Centerline Extraction from Large-Scale Remote Sensing Images Using CNN-Based Segmentation and Tracing;Wei;IEEE Trans. Geosci. Remote Sens.,2020

2. A Fusion Network for Road Detection via Spatial Propagation and Spatial Transformation;Yang;Pattern Recognit.,2020