Text Detection and Recognition for X-ray Weld Seam Images-Reference-Cited by-同舟云学术

Text Detection and Recognition for X-ray Weld Seam Images

Published:2024-03-13 Issue:6 Volume:14 Page:2422
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Zheng Qihang¹,Zhang Yaping¹^ORCID

Affiliation:

1. School of Information, Yunnan Normal University, Kunming 650500, China

Abstract

X-ray weld seam images carry vital information about welds. Leveraging graphic–text recognition technology enables intelligent data collection in complex industrial environments, promising significant improvements in work efficiency. This study focuses on using deep learning methods to enhance the accuracy and efficiency of detecting weld seam information. We began by actively gathering a dataset of X-ray weld seam images for model training and evaluation. The study comprises two main components: text detection and text recognition. For text detection, we employed a model based on the DBNet algorithm and tailored post-processing techniques to the unique features of weld seam images. Through model training, we achieved efficient detection of the text regions, with 91% precision, 92.4% recall, and a 91.7% F1 score on the test dataset. In the text recognition phase, we introduced modules like CA, CBAM, and HFA to capture the character position information and global text features effectively. This optimization led to a remarkable text line recognition accuracy of 93.4%. In conclusion, our study provides an efficient deep learning solution for text detection and recognition in X-ray weld seam images, offering robust support for weld seam information collection in industrial manufacturing.

Funder

Yunnan Provincial Agricultural Basic Research Joint Special Project

Yunnan Ten-Thousand Talents Program

Publisher

MDPI AG

Link

https://www.mdpi.com/2076-3417/14/6/2422/pdf

Reference33 articles.

1. Tian, Z., Huang, W., He, T., He, P., and Qiao, Y. (2016). Proceedings of the Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, 11–14 October 2016, Proceedings, Part VIII 14, Springer.

2. He, K., Gkioxari, G., Dollár, P., and Girshick, R. (2017, January 22–29). Mask r-cnn. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.

3. Wang, W., Xie, E., Li, X., Hou, W., Lu, T., Yu, G., and Shao, S. (2019, January 15–20). Shape robust text detection with progressive scale expansion network. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.

4. Zhu, Y., Chen, J., Liang, L., Kuang, Z., Jin, L., and Zhang, W. (2021, January 20–25). Fourier contour embedding for arbitrary-shaped text detection. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.

5. Liao, M., Wan, Z., Yao, C., Chen, K., and Bai, X. (2020, January 7–12). Real-time scene text detection with differentiable binarization. Proceedings of the AAAI Conference on Artificial Intelligence, New York, NY, USA.