Affiliation:
1. National Key Laboratory of Fundamental Science on Synthetic Vision, Sichuan University, Chengdu 610065, China
2. Independent Researcher, Chengdu 610095, China
Abstract
Models based on joint detection and re-identification (ReID), which significantly increase the efficiency of online multi-object tracking (MOT) systems, are an evolution from separate detection and ReID models in the tracking-by-detection (TBD) paradigm. It is observed that these joint models are typically one-stage, while the two-stage models become obsolete because of their slow speed and low efficiency. However, the two-stage models have naive advantages over the one-stage anchor-based and anchor-free models in handling feature misalignment and occlusion, which suggests that the two-stage models, via meticulous design, could be on par with the state-of-the-art one-stage models. Following this intuition, we propose a robust and efficient two-stage joint model based on R–FCN, whose backbone and neck are fully convolutional, and the RoI-wise process only involves simple calculations. In the first stage, an adaptive sparse anchoring scheme is utilized to produce adequate, high-quality proposals to improve efficiency. To boost both detection and ReID, two key elements—feature aggregation and feature disentanglement—are taken into account. To improve robustness against occlusion, the position-sensitivity is exploited, first to estimate occlusion and then to direct the post-process for anti-occlusion. Finally, we link the model to a hierarchical association algorithm to form a complete MOT system called PSMOT. Compared to other cutting-edge systems, PSMOT achieves competitive performance while maintaining time efficiency.
Funder
Key R&D Projects in Sichuan Province, China
Reference66 articles.
1. Motion Planning for Autonomous Driving: The State of the Art and Future Perspectives;Teng;IEEE Trans. Intell. Veh.,2023
2. Varghese, E.B., and Thampi, S.M. (2023). Intelligent Image and Video Analytics, Routledge.
3. Deep Learning-based Visual Multiple Object Tracking: A Review;Wu;Comput. Sci.,2023
4. Voigtlaender, P., Krause, M., Osep, A., Luiten, J., Sekar, B.B.G., Geiger, A., and Leibe, B. (2019, January 15–20). Mots: Multi-object tracking and segmentation. Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.
5. Wang, Z., Zheng, L., Liu, Y., Li, Y., and Wang, S. (2020, January 23–28). Towards real-time multi-object tracking. Proceedings of the Computer Vision-ECCV2020: European Conference, Glasgow, UK.