Author:
Zhang Junjie,Zhang Yuchen,Liu Jindong,Lan Yuxuan,Zhang Tianxiang
Abstract
AbstractThe unearthed Han Dynasty portrait stones are an important part of China’s ancient artistic heritage, and detecting human images in these stones is a critical prerequisite for studying their artistic value. However, high-precision target detection techniques often result in a large number of parameters, making them unsuitable for portable devices. In this work, we propose a new human image target detection model based on an enhanced YOLO-v5. We discovered that the complex backgrounds, dense group targets, and significant scale variations of targets within large scenes in portrait stones present significant challenges for human target image detection. Therefore, we first incorporated the SPD-Conv convolution and Coordinate Attention self-attention mechanism modules into the YOLO-v5 architecture, aiming to enhance the model’s recognition precision for small target images within Han portrait stones and strengthen its resistance to background disturbances. Moreover, we introduce DIoU NMS and Alpha-IoU Loss to improve the detector’s performance in dense target scenarios, reducing the omission of densely packed objects. Finally, the experimental results from our collected dataset of Han Dynasty stone figure images demonstrate that our method achieves fast convergence and high recognition accuracy. This approach can be better applied to the target detection tasks of special character images in complex backgrounds.
Publisher
Springer Science and Business Media LLC
Reference27 articles.
1. Chang K.-c. Early chinese civilization 1976;23.
2. Ebrey P. Later han stone inscriptions. Harvard J Asiatic Stud. 1980. 40:325–53.
3. Zou Z, Chen K, Shi Z, Guo Y, Ye J. Object detection in 20 years: a survey. Proc IEEE. 2023. 111:257–76.
4. Li Q, Chen Y, Zeng Y. Transformer with transfer cnn for remote-sensing-image object detection. Remote Sensing. 2022. 14:984.
5. Girshick R, Donahue J, Darrell T, Malik. Jitendra, rich feature hierarchies for accurate object detection and semantic segmentation. Proc IEEE Computer Vision Pattern Recogn. 2014. 98:580–7.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献