Crack45K: Integration of Vision Transformer with Tubularity Flow Field (TuFF) and Sliding-Window Approach for Crack-Segmentation in Pavement Structures-Reference-Cited by-同舟云学术

Crack45K: Integration of Vision Transformer with Tubularity Flow Field (TuFF) and Sliding-Window Approach for Crack-Segmentation in Pavement Structures

Published:2022-12-26 Issue:1 Volume:13 Page:55
ISSN:2075-5309
Container-title:Buildings
language:en
Short-container-title:Buildings

Author:

Ali Luqman^ORCID,Jassmi Hamad Al^ORCID,Khan Wasif^ORCID,Alnajjar Fady^ORCID

Abstract

Recently, deep-learning (DL)-based crack-detection systems have proven to be the method of choice for image processing-based inspection systems. However, human-like generalization remains challenging, owing to a wide variety of factors such as crack type and size. Additionally, because of their localized receptive fields, CNNs have a high false-detection rate and perform poorly when attempting to capture the relevant areas of an image. This study aims to propose a vision-transformer-based crack-detection framework that treats image data as a succession of small patches, to retrieve global contextual information (GCI) through self-attention (SA) methods, and which addresses the CNNs’ problem of inductive biases, including the locally constrained receptive-fields and translation-invariance. The vision-transformer (ViT) classifier was tested to enhance crack classification, localization, and segmentation performance by blending with a sliding-window and tubularity-flow-field (TuFF) algorithm. Firstly, the ViT framework was trained on a custom dataset consisting of 45K images with 224 × 224 pixels resolution, and achieved accuracy, precision, recall, and F1 scores of 0.960, 0.971, 0.950, and 0.960, respectively. Secondly, the trained ViT was integrated with the sliding-window (SW) approach, to obtain a crack-localization map from large images. The SW-based ViT classifier was then merged with the TuFF algorithm, to acquire efficient crack-mapping by suppressing the unwanted regions in the last step. The robustness and adaptability of the proposed integrated-architecture were tested on new data acquired under different conditions and which were not utilized during the training and validation of the model. The proposed ViT-architecture performance was evaluated and compared with that of various state-of-the-art (SOTA) deep-learning approaches. The experimental results show that ViT equipped with a sliding-window and the TuFF algorithm can enhance real-world crack classification, localization, and segmentation performance.

Publisher

MDPI AG

Subject

Building and Construction,Civil and Structural Engineering,Architecture

Link

https://www.mdpi.com/2075-5309/13/1/55/pdf

Reference68 articles.

1. CrackTree: Automatic crack detection from pavement images;Zou;Pattern Recognit. Lett.,2012

2. Analysis of Edge-Detection Techniques for Crack Identification in Bridges;Abudayyeh;J. Comput. Civ. Eng.,2003

3. Sealed-Crack Detection Algorithm Using Heuristic Thresholding Approach;Kamaliardakani;J. Comput. Civ. Eng.,2016

4. FoSA: F* Seed-growing Approach for crack-line detection from pavement images;Li;Image Vis. Comput.,2011

5. Morphological segmentation and classification of underground pipe images;Sinha;Mach. Vis. Appl.,2006

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Recent advances in crack detection technologies for structures: a survey of 2022-2023 literature;Frontiers in Built Environment;2024-07-30

2. AI-based rock strength assessment from tunnel face images using hybrid neural networks;Scientific Reports;2024-07-30

3. ViT-Based Image Regression Model for Shear-Strength Prediction of Transparent Soil;Buildings;2024-04-01

4. Deep Learning for Automated Visual Inspection in Manufacturing and Maintenance: A Survey of Open- Access Papers;Applied System Innovation;2024-01-22

5. Research on road damage recognition and classification based on improved VGG-19;Mathematical Models in Engineering;2023-10-06