Affiliation:
1. College of Electron and Information, University of Electronic Science and Technology of China, Zhongshan Institute, Zhongshan 528402, China
2. Heilongjiang Province Key Laboratory of Laser Spectroscopy Technology and Application, Harbin University of Science and Technology, Harbin 150080, China
Abstract
In response to the problems of complex background, multi-scale dangerous goods and severe stacking in X-ray security images, this paper proposes a high-accuracy dangerous goods detection algorithm for X-ray security images based on the improvement of YOLOv7. Firstly, by combining the coordinate attention mechanism, the downsampling structure of the backbone network is improved to enhance the model’s target feature localization ability. Secondly, a weighted bidirectional feature pyramid network is used as the feature fusion structure to achieve multi-scale feature weighted fusion and further simplify the network. Then, combined with dynamic snake convolution, a downsampling structure was designed to facilitate the extraction of features at different scales, providing richer feature representations. Finally, drawing inspiration from the idea of group convolution and combining it with Conv2Former, a feature extraction module called a multi-convolution transformer (MCT) was designed to enhance the network’s feature extraction ability by combining multi-scale information. The improved YOLOv7 in this article was tested on the public datasets SIXRay, CLCXray, and PIDray. The average detection accuracy (mAP) of the improved model was 96.3%, 79.3%, and 84.7%, respectively, which was 4.7%, 2.7%, and 3.1% higher than YOLOv7. This proves the effectiveness and universality of the method proposed in this article. Compared to the current mainstream X-ray image dangerous goods detection models, this model effectively reduces the false detection rate of dangerous goods in X-ray security inspection images and has achieved significant improvement in the detection of small and multi-scale targets, achieving higher accuracy in dangerous goods detection.
Funder
the Key Research and Development Plan Project of Heilongjiang
Reference38 articles.
1. Security Inspection Image Detection and Application Based on Deep Learning;Wang;Mod. Inf. Technol.,2021
2. Han, B.-M., Xi, Z., and Sun, Y.-J. (2022). Summary of 2022 World Urban Rail Transit Operation Statistics and Analysis, Urban Rapid Rail Transit.
3. Bastan, M., Yousefi, M.R., and Breuel, T.M. (2011, January 29–31). Visual words on baggage X-ray images. Proceedings of the 2011 International Conference on Computer Analysis of Images and Patterns, Seville, Spain.
4. Turcsany, D., Mouton, A., and Breckon, T.P. (2013, January 25–28). Improving feature-based object recognition for X-ray baggage security screening using primed visualwords. Proceedings of the 2013 International Conference on Industrial Technology, Cape Town, South Africa.
5. Flitton, G., Mouton, A., and Breckon, T.P. (2015). Object Classification in 3D Baggage Security Computed Tomography Imagery Using Visual Codebooks, Elsevier Science Inc.