1. Luo, W., Yang, B., & Urtasun, R. (2018). Fast and furious: real time end-to-end 3D detection, tracking and motion forecasting with a single convolutional net. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 3569–3577). Piscataway: IEEE.
2. Li, P., & Jin, J. (2022). Time3D: end-to-end joint monocular 3D object detection and tracking for autonomous driving. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 3875–3884). Piscataway: IEEE.
3. Luo, C., Yang, X., & Yuille, A. L. (2021). Exploring simple 3D multi-object tracking for autonomous driving. In Proceedings of the IEEE/CVF international conference on computer vision (pp. 10468–10477). Piscataway: IEEE.
4. Wilson, J., & Lin, M. C. (2020). AVOT: audio-visual object tracking of multiple objects for robotics. In Proceedings of IEEE international conference on robotics and automation (pp. 10045–10051). Piscataway: IEEE.
5. Xu, T., Zhu, X.-F., & Wu, X.-J. (2023). Learning spatio-temporal discriminative model for affine subspace based visual object tracking. Visual Intelligence, 1(1), 4.