Depth-Enhanced Deep Learning Approach For Monocular Camera Based 3D Object Detection-Reference-Cited by-同舟云学术

Depth-Enhanced Deep Learning Approach For Monocular Camera Based 3D Object Detection

Published:2024-07-09 Issue:3 Volume:110 Page:
ISSN:1573-0409
Container-title:Journal of Intelligent & Robotic Systems
language:en
Short-container-title:J Intell Robot Syst

Author:

Wang Chuyao^ORCID,Aouf Nabil

Abstract

AbstractAutomatic 3D object detection using monocular cameras presents significant challenges in the context of autonomous driving. Precise labeling of 3D object scales requires accurate spatial information, which is difficult to obtain from a single image due to the inherent lack of depth information in monocular images, compared to LiDAR data. In this paper, we propose a novel approach to address this issue by enhancing deep neural networks with depth information for monocular 3D object detection. The proposed method comprises three key components: 1)Feature Enhancement Pyramid Module: We extend the conventional Feature Pyramid Networks (FPN) by introducing a feature enhancement pyramid network. This module fuses feature maps from the original pyramid and captures contextual correlations across multiple scales. To increase the connectivity between low-level and high-level features, additional pathways are incorporated. 2)Auxiliary Dense Depth Estimator: We introduce an auxiliary dense depth estimator that generates dense depth maps to enhance the spatial perception capabilities of the deep network model without adding computational burden. 3)Augmented Center Depth Regression: To aid center depth estimation, we employ additional bounding box vertex depth regression based on geometry. Our experimental results demonstrate the superiority of the proposed technique over existing competitive methods reported in the literature. The approach showcases remarkable performance improvements in monocular 3D object detection, making it a promising solution for autonomous driving applications.

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s10846-024-02128-w.pdf

Reference58 articles.

1. He, L., Aouf, N., Whidborne, J.F., et al.: Integrated moment-based LGMD and deep reinforcement learning for UAV obstacle avoidance. 2020 IEEE International Conference on Robotics and Automation (ICRA), pp. 7491-7497, IEEE, (2020)

2. Shah, M.A., Aouf, N.: 3d cooperative pythagorean hodograph path planning and obstacle avoidance for multiple uavs. 2010 IEEE 9th International Conference on Cyberntic Intelligent Systems, pp. 1-6, IEEE, (2010)

3. Kanchwala, H., Bezerra Viana, I., Aouf, N.: Cooperative path-planning and tracking controller evaluation using vehicle models of varying complexities. Proc. Inst. Mech. Eng. Pt. C J. Mechan. Eng. Sci. 235(16), 2877–2896 (2021)

4. Wang, C., Aouf, N.: Explainable deep adversarial reinforcement learning approach for robust autonomous driving. IEEE Trans. Intell. Veh. (2024)

5. Girshick, R.: Fast r-cnn. Proceedings of the IEEE international conference on computer vision, pp. 1440-1448 (2015)