Abstract
Aiming at the problem that the SSD algorithm does not fully extract the feature information contained in each feature layer, as well as the feature information is easily lost during the sampling process, which makes the feature expression ineffective and leads to insufficient performance in small target detection. In this paper, AMT-SSD is proposed, a small target detection algorithm that incorporates the multi-branch stacking and new sampling transition module of the attention mechanism. In this algorithm, the composite attention mechanism is utilized to improve the correlation of features of the samples to be detected in terms of spatial and channels, and the efficiency of the algorithm; secondly, multi-branch stacking module is used to extract multi-size features for each feature layer, and different sizes of convolution kernels are utilized in parallel to fully extract their features and improve the expression of features; meanwhile, during the sampling process, the problem of missing features is solved by applying inverse subpixel convolution in the new sampling transition module. Experimentally, the AMT-SSD algorithm achieves 84.6% and 53.4% mAP metrics on the PASCAL VOC dataset and MS COCO dataset, respectively. This indicates that the AMT-SSD algorithm can effectively extract feature information that is beneficial to detection samples, and also performs well in reducing feature loss, which is effective for the algorithm to improve the algorithm on small targets.
Funder
Jiangsu Graduate Practical Innovation Project
Major Project of Philosophy and Social Science Research in Colleges and Universities of Jiangsu Province
Natural Science Foundation of China under Grant
Natural Science Research Project of Jiangsu University
Publisher
Public Library of Science (PLoS)
Reference41 articles.
1. Performance releaser with smart anchor learning for arbitrary‐oriented object detection;T W Zhang;CAAI Transactions on Intelligence Technology,2023
2. IFODPSO-based multi-level image segmentation scheme aided with Masi entropy;R Chakraborty;Journal of Ambient Intelligence and Humanized Computing,2021
3. Needle detection and localisation for robot‐assisted subretinal injection using deep learning;M Zhou;CAAI Transactions on Intelligence Technology,2023
4. QEST: Quantized and efficient scene text detector using deep learning;K Manjari;ACM Transactions on Asian and Low-Resource Language Information Processing,2023
5. Spatial pyramid pooling in deep convolutional networks for visual recognition;K He;IEEE transactions on pattern analysis and machine intelligence,2015