1. CHABOT F, CHAOUCH M, RABARISOA J, et al. Deep MANTA: A coarse-to-fine many-task network for joint 2D and 3D vehicle analysis from monocular image [C]//IEEE Conference on Computer Vision and Pattern Recognition. Honolulu: IEEE, 2017: 1827–1836.
2. KUNDU A, LI Y, REHG J M. 3D-RCNN: Instance-level 3D object reconstruction via render-and-compare [C]//UT2018 IEEE /CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2018: 3559–3568.
3. YOU Y R, WANG Y, CHAO W L, et al. Pseudo-LiDAR++: Accurate depth for 3D object detection in autonomous driving [EB/OL]. (2020-02-15) [2022-04-10]. https://arxiv.org/abs/1906.06310.
4. RODDICK T, KENDALL A, CIPOLLA C. Orthographic feature transform for monocular 3D object detection [EB/OL]. (2018-11-20) [2022-04-10]. https://arxiv.org/abs/1811.08188.
5. BRAZIL G, LIU X M. M3D-RPN: Monocular 3D region proposal network for object detection [C]//IEEE/CVF International Conference on Computer Vision (ICCV). Seoul: IEEE, 2019: 9286–9295.