Deep learning framework with Local Sparse Transformer for construction worker detection in 3D with LiDAR-Reference-Cited by-同舟云学术

Deep learning framework with Local Sparse Transformer for construction worker detection in 3D with LiDAR

Published:2024-05-26 Issue:19 Volume:39 Page:2990-3007
ISSN:1093-9687
Container-title:Computer-Aided Civil and Infrastructure Engineering
language:en
Short-container-title:Computer aided Civil Eng

Author:

Zhang Mingyu¹,Wang Lei¹,Han Shuai¹,Wang Shuyuan¹,Li Heng¹

Affiliation:

1. Department of Building and Real Estate Hong Kong Polytechnic University Hong Kong China

Abstract

AbstractAutonomous equipment is playing an increasingly important role in construction tasks. It is essential to equip autonomous equipment with powerful 3D detection capability to avoid accidents and inefficiency. However, there is limited research within the construction field that has extended detection to 3D. To this end, this study develops a light detection and ranging (LiDAR)‐based deep‐learning model for the 3D detection of workers on construction sites. The proposed model adopts a voxel‐based anchor‐free 3D object detection paradigm. To enhance the feature extraction capability for tough detection tasks, a novel Transformer‐based block is proposed, where the multi‐head self‐attention is applied in local grid regions. The detection model integrates the Transformer blocks with 3D sparse convolution to extract wide and local features while pruning redundant features in modified downsampling layers. To train and test the proposed model, a LiDAR point cloud dataset was created, which includes workers in construction sites with 3D box annotations. The experiment results indicate that the proposed model outperforms the baseline models with higher mean average precision and smaller regression errors. The method in the study is promising to provide worker detection with rich and accurate 3D information required by construction automation.

Publisher

Wiley

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1111/mice.13238

Reference60 articles.

1. Allinson M.(2022).Construction robotics startup Canvas launches drywall finishing robot. Robotics and Automation News.https://roboticsandautomationnews.com/2022/01/27/construction‐robotics‐startup‐canvas‐launches‐drywall‐finishing‐robot/48705/

2. Real Time Apnoea Monitoring of Children Using the Microsoft Kinect Sensor: A Pilot Study

3. Beltrán J. Guindel C. Moreno F. M. Cruzado D. García F. &De La Escalera A.(2018).BirdNet: A 3D object detection framework from LiDAR information.2018 21st International Conference on Intelligent Transportation Systems (ITSC) Maui HI (pp.3517–3523).https://doi.org/10.1109/ITSC.2018.8569311

4. Business Research. (2023).Autonomous construction equipment market size trends and global forecast To 2032. The Business Research Company.https://www.thebusinessresearchcompany.com/report/autonomous‐construction‐equipment‐global‐market‐report

5. Caesar H. Bankiti V. Lang A. H. Vora S. Liong V. E. Xu Q. Krishnan A. Pan Y. Baldan G. &Beijbom O.(2020).nuScenes: A multimodal dataset for autonomous driving.2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Seattle WA (pp.11618–11628).https://doi.org/10.1109/CVPR42600.2020.01164