Improving Multiple Pedestrian Tracking in Crowded Scenes with Hierarchical Association
Author:
Xiao Changcheng1ORCID, Luo Zhigang1
Affiliation:
1. School of Computer Science, National University of Defense Technology, Changsha 410000, China
Abstract
Recently, advances in detection and re-identification techniques have significantly boosted tracking-by-detection-based multi-pedestrian tracking (MPT) methods and made MPT a great success in most easy scenes. Several very recent works point out that the two-step scheme of first detection and then tracking is problematic and propose using the bounding box regression head of an object detector to realize data association. In this tracking-by-regression paradigm, the regressor directly predicts each pedestrian’s location in the current frame according to its previous position. However, when the scene is crowded and pedestrians are close to each other, the small and partially occluded targets are easily missed. In this paper, we follow this pattern and design a hierarchical association strategy to obtain better performance in crowded scenes. To be specific, at the first association, the regressor is used to estimate the positions of obvious pedestrians. At the second association, we employ a history-aware mask to filter out the already occupied regions implicitly and look carefully at the remaining regions to find out the ignored pedestrians during the first association. We integrate the hierarchical association in a learning framework and directly infer the occluded and small pedestrians in an end-to-end way. We conduct extensive pedestrian tracking experiments on three public pedestrian tracking benchmarks from less crowded to crowded scenes, demonstrating the proposed strategy’s effectiveness in crowded scenes.
Subject
General Physics and Astronomy
Reference60 articles.
1. Bewley, A., Ge, Z., Ott, L., Ramos, F., and Upcroft, B. (2016, January 25–28). Simple online and realtime tracking. Proceedings of the IEEE International Conference on Image Processing, Phoenix, AZ, USA. 2. Sun, S., Akhtar, N., Song, H., Mian, A., and Shah, M. (2018). Deep Affinity Network for Multiple Object Tracking. arXiv. 3. Ren, S., He, K., Girshick, R., and Sun, J. (2015, January 7–12). Faster r-cnn: Towards real-time object detection with region proposal networks. Proceedings of the Advances in Neural Information Processing Systems, Montreal, QC, Canada. 4. Zhou, X., Koltun, V., and Krähenbühl, P. (2020). Tracking Objects as Points. arXiv. 5. Yang, K., Li, D., and Dou, Y. (November, January 27). Towards Precise End-to-End Weakly Supervised Object Detection Network. Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Republic of Korea.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|