1. The KITTI Vision Benchmark Suite. https://www.cvlibs.net/datasets/kitti/eval_object.php?obj_benchmark=3d. Accessed 03 July 2022
2. Alhaija, H., Mustikovela, S., Mescheder, L., Geiger, A., Rother, C.: Augmented reality meets computer vision: efficient data generation for urban driving scenes. IJCV (2018)
3. Bochkovskiy, A., Wang, C.Y., Liao, H.Y.M.: YOLOv4: Optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934 (2020)
4. Brazil, G., Liu, X.: M$$3$$D-RPN: monocular $$3$$D region proposal network for object detection. In: ICCV (2019)
5. Brazil, G., Pons-Moll, G., Liu, X., Schiele, B.: Kinematic $$3$$D object detection in monocular video. In: ECCV (2020)