Affiliation:
1. School of Information Technology, Yancheng Institute of Technology, Yancheng 224051, China
2. Yancheng Xiongying Precision Machinery Company Limited, Yancheng 224006, China
Abstract
To address the problems of less semantic information and low measurement accuracy when the SSD (single shot multibox detector) algorithm detects small targets, an MPH-SSD (multiscale pyramid hybrid SSD) algorithm that integrates the attention mechanism and multiscale double pyramid feature enhancement is proposed in this paper. In this algorithm, firstly, the SSD algorithm is used to extract the feature map of small targets, and the shallow feature enhancement module is added to expand the receptive field of the shallow feature layer so as to enrich the semantic information in the feature layer for small targets and improve the expression ability of shallow features. The processed shallow feature layer and deep feature layer are fused at multiple scales, and the semantic information and location information are fused together to obtain a feature map with rich information. Secondly, the cascaded double pyramid structure is used to transfer from the deep layer to the shallow layer so that the context information between different feature layers can be effectively transferred and the feature information can be further strengthened. The hybrid attention mechanism can retain more context information in the network, adaptively adjust the feature map after addition and fusion, and reduce the background interference. The experimental analysis of MPH-SSD algorithm on Pascal VOC and MS COCO datasets shows that the map of this algorithm is 87.7% and 51.1%, respectively. The results show that the MPH-SSD algorithm can make better use of the feature information in the shallow feature layer in the process of small target detection and has better detection performance for small targets.
Funder
Jiangsu Graduate Practical Innovation Project
Subject
General Mathematics,General Medicine,General Neuroscience,General Computer Science
Reference44 articles.
1. Deep building footprint update network: A semi-supervised method for updating existing building footprint from bi-temporal remote sensing images
2. Learning Token-Aligned Representations With Multimodel Transformers for Different-Resolution Change Detection
3. A review of research on small target detection based on deep learning;Y. Zhang;Computer Engineering and Applications,2022
4. Automatic recognition of cotton growth by combining deep learning based object recognition and image classification;L. Wu;China Sciencepaper,2018
5. Detection of moving cows based on adaptive kernel density estimation algorithm;S. O. N. G. Huaibo;Transactions of the Chinese Society for Agricultural Machinery,2019
Cited by
9 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献