HA-RoadFormer: Hybrid Attention Transformer with Multi-Branch for Large-Scale High-Resolution Dense Road Segmentation-Reference-Cited by-同舟云学术

HA-RoadFormer: Hybrid Attention Transformer with Multi-Branch for Large-Scale High-Resolution Dense Road Segmentation

Published:2022-06-02 Issue:11 Volume:10 Page:1915
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

Zhang Zheng,Miao Chunle^ORCID,Liu Changan,Tian Qing,Zhou Yongsheng^ORCID

Abstract

Road segmentation is one of the essential tasks in remote sensing. Large-scale high-resolution remote sensing images originally have larger pixel sizes than natural images, while the existing models based on Transformer have the high computational cost of square complexity, leading to more extended model training and inference time. Inspired by the long text Transformer model, this paper proposes a novel hybrid attention mechanism to improve the inference speed of the model. By calculating several diagonals and random blocks of the attention matrix, hybrid attention achieves linear time complexity in the token sequence. Using the superposition of adjacent and random attention, hybrid attention introduces the inductive bias similar to convolutional neural networks (CNNs) and retains the ability to acquire long-distance dependence. In addition, the dense road segmentation result of remote sensing image still has the problem of insufficient continuity. However, multiscale feature representation is an effective means in the network based on CNNs. Inspired by this, we propose a multi-scale patch embedding module, which divides images by patches with different scales to obtain coarse-to-fine feature representations. Experiments on the Massachusetts dataset show that the proposed HA-RoadFormer could effectively preserve the integrity of the road segmentation results, achieving a higher Intersection over Union (IoU) 67.36% of road segmentation compared to other state-of-the-art (SOTA) methods. At the same time, the inference speed has also been greatly improved compared with other Transformer based models.

Funder

North China University of Technology Research Start-up Funds

Publisher

MDPI AG

Subject

General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2227-7390/10/11/1915/pdf

Reference51 articles.

1. A New Approach to Urban Road Extraction Using High-Resolution Aerial Image

2. An Integrated Method for Urban Main-Road Centerline Extraction From Optical Remotely Sensed Imagery

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. PCCAU-Net: A Novel Road Extraction Method Based on Coord Convolution and a DCA Module;Applied Sciences;2024-02-18

2. DELFormer: detail-enhanced lightweight transformer for road segmentation;Journal of Applied Remote Sensing;2023-11-20

3. GLFFNet: A Global and Local Features Fusion Network with Biencoder for Remote Sensing Image Segmentation;Applied Sciences;2023-07-28

4. Low-parameter method for delineation of agricultural fields in satellite images based on multi-temporal MSAVI2 data;COMPUT OPT;2023

5. ACTNet: A Dual-Attention Adapter with a CNN-Transformer Network for the Semantic Segmentation of Remote Sensing Imagery;Remote Sensing;2023-04-29