Cross-Attention-Guided Feature Alignment Network for Road Crack Detection-Reference-Cited by-同舟云学术

Cross-Attention-Guided Feature Alignment Network for Road Crack Detection

Published:2023-09-19 Issue:9 Volume:12 Page:382
ISSN:2220-9964
Container-title:ISPRS International Journal of Geo-Information
language:en
Short-container-title:IJGI

Author:

Xu Chuan¹,Zhang Qi¹,Mei Liye¹,Chang Xiufeng²,Ye Zhaoyi¹^ORCID,Wang Junjian³,Ye Lang³,Yang Wei³^ORCID

Affiliation:

1. School of Computer Science, Hubei University of Technology, Wuhan 430068, China

2. Unit 92493, Huludao 125000, China

3. School of Information Science and Engineering, Wuchang Shouyi University, Wuhan 430064, China

Abstract

Road crack detection is one of the important issues in the field of traffic safety and urban planning. Currently, road damage varies in type and scale, and often has different sizes and depths, making the detection task more challenging. To address this problem, we propose a Cross-Attention-guided Feature Alignment Network (CAFANet) for extracting and integrating multi-scale features of road damage. Firstly, we use a dual-branch visual encoder model with the same structure but different patch sizes (one large patch and one small patch) to extract multi-level damage features. We utilize a Cross-Layer Interaction (CLI) module to establish interaction between the corresponding layers of the two branches, combining their unique feature extraction capability and contextual understanding. Secondly, we employ a Feature Alignment Block (FAB) to align the features from different levels or branches in terms of semantics and spatial aspects, which significantly improves the CAFANet’s perception of the damage regions, reduces background interference, and achieves more precise detection and segmentation of damage. Finally, we adopt multi-layer convolutional segmentation heads to obtain high-resolution feature maps. To validate the effectiveness of our approach, we conduct experiments on the public CRACK500 dataset and compare it with other mainstream methods. Experimental results demonstrate that CAFANet achieves excellent performance in road crack detection tasks, which exhibits significant improvements in terms of F1 score and accuracy, with an F1 score of 73.22% and an accuracy of 96.78%.

Funder

Hubei University of Technology

Natural Science Foundation of Hubei Province

University Student innovation and Entrepreneurship Training Program Project

Publisher

MDPI AG

Subject

Earth and Planetary Sciences (miscellaneous),Computers in Earth Sciences,Geography, Planning and Development

Link

https://www.mdpi.com/2220-9964/12/9/382/pdf

Reference54 articles.

1. Vibration vs. vision: Best approach for automated pavement distress detection;Lekshmipathy;Int. J. Pavement Res. Technol.,2020