SPCS: a spatial pyramid convolutional shuffle module for YOLO to detect occluded object-Reference-Cited by-同舟云学术

SPCS: a spatial pyramid convolutional shuffle module for YOLO to detect occluded object

Published:2022-06-29 Issue:1 Volume:9 Page:301-315
ISSN:2199-4536
Container-title:Complex & Intelligent Systems
language:en
Short-container-title:Complex Intell. Syst.

Author:

Li Xiang,He Miao,Liu Yan,Luo Haibo^ORCID,Ju Moran

Abstract

AbstractIn crowded scenes, one of the most important issues is that heavily overlapped objects are hardly distinguished from each other since most of their pixels are shared and the visible pixels of the occluded objects, which are used to represent their features, are limited. In this paper, a spatial pyramid convolutional shuffle (SPCS) module is proposed to extract refined information from the limited visible pixels of the occluded objects and generate distinguishable representations for the heavily overlapped objects. We adopt four convolutional kernels with different sizes and dilation rates at each location in the pyramid features and adjacently recombine their fused outputs spatially using a pixel shuffle module. In this way, four distinguishable instance predictions corresponding different convolutional kernels can be produced for each location in the pyramid feature. In addition, multiple convolutional operations with different kernel sizes and dilation rates at the same location can generate refined information for the corresponding regions, which is helpful to extract features for the occluded objects from their limited visible pixels. Extensive experimental results demonstrate that SPCS module can effectively boost the performance in crowded human detection. YOLO detector with SPCS module achieves 94.11% AP, 41.75% MR, 97.75% Recall on CrowdHuman, 93.04% AP, and 98.45% Recall on WiderPerson, which are the best compared with previous state-of-the-art models.

Publisher

Springer Science and Business Media LLC

Subject

Computational Mathematics,Engineering (miscellaneous),Information Systems,Artificial Intelligence

Link

https://link.springer.com/content/pdf/10.1007/s40747-022-00786-7.pdf

Reference47 articles.

1. Yang Y, Tang X, Cheung Y-M, Zhang X, Liu F, Ma J, Jiao L (2022) Ar²det: An accurate and real-time rotational one-stage ship detector in remote sensing images. IEEE Trans Geosci Remote Sens 60:1–14. https://doi.org/10.1109/TGRS.2021.3092433

2. Ma W, Li N, Zhu H, Jiao L, Tang X, Guo Y, Hou B (2022) Feature split–merge–enhancement network for remote sensing object detection. IEEE Trans Geosci Remote Sens 60:1–17. https://doi.org/10.1109/TGRS.2022.3140856

3. Chen N, Li M, Yuan H, Su X, Li Y (2021) Survey of pedestrian detection with occlusion. Complex Intell Syst 7:577–587. https://doi.org/10.1007/s40747-020-00206-8

4. Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: Unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

5. Redmon J, Farhadi A (2017) Yolo9000: Better, faster, stronger. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. SMR–YOLO: Multi-Scale Detection of Concealed Suspicious Objects in Terahertz Images;Photonics;2024-08-22

2. Object Part Appearance Module built into Yolo for Occlusion;2024 8th International Conference on Image and Signal Processing and their Applications (ISPA);2024-04-21

3. Personal protective equipment detection using YOLOv8 architecture on object detection benchmark datasets: a comparative study;Cogent Engineering;2024-04-12

4. A small object detection algorithm based on feature interaction and guided learning;Journal of Visual Communication and Image Representation;2024-02

5. A Human Posture Estimation Method for Image Interaction System Based on ECA;Communications in Computer and Information Science;2024