Probabilistic multi-modal depth estimation based on camera–LiDAR sensor fusion-Reference-Cited by-同舟云学术

Probabilistic multi-modal depth estimation based on camera–LiDAR sensor fusion

Published:2023-07-29 Issue:5 Volume:34 Page:
ISSN:0932-8092
Container-title:Machine Vision and Applications
language:en
Short-container-title:Machine Vision and Applications

Author:

Obando-Ceron Johan S.^ORCID,Romero-Cano Victor^ORCID,Monteiro Sildomar^ORCID

Abstract

AbstractMulti-modal depth estimation is one of the key challenges for endowing autonomous machines with robust robotic perception capabilities. There have been outstanding advances in the development of uni-modal depth estimation techniques based on either monocular cameras, because of their rich resolution, or LiDAR sensors, due to the precise geometric data they provide. However, each of these suffers from some inherent drawbacks, such as high sensitivity to changes in illumination conditions in the case of cameras and limited resolution for the LiDARs. Sensor fusion can be used to combine the merits and compensate for the downsides of these two kinds of sensors. Nevertheless, current fusion methods work at a high level. They process the sensor data streams independently and combine the high-level estimates obtained for each sensor. In this paper, we tackle the problem at a low level, fusing the raw sensor streams, thus obtaining depth estimates which are both dense and precise, and can be used as a unified multi-modal data source for higher-level estimation problems. This work proposes a conditional random field model with multiple geometry and appearance potentials. It seamlessly represents the problem of estimating dense depth maps from camera and LiDAR data. The model can be optimized efficiently using the conjugate gradient squared algorithm. The proposed method was evaluated and compared with the state of the art using the commonly used KITTI benchmark dataset.

Funder

Universidad Autónoma de Occidente, Cali, Colombia

Publisher

Springer Science and Business Media LLC

Subject

Computer Science Applications,Computer Vision and Pattern Recognition,Hardware and Architecture,Software

Link

https://link.springer.com/content/pdf/10.1007/s00138-023-01426-x.pdf

Reference76 articles.

1. Achanta, R., Shaji, A., Smith, K., Lucchi, A., Fua, P., Susstrunk, S.: Slic superpixels compared to state-of-the-art superpixel methods. IEEE Trans. Pattern Anal. Mach. Intell. 34, 2274–2282 (2012)

2. Agamennoni, G., Furgale, P., Siegwart, R.: Self-tuning m-estimators 4628–4635 (2015)

3. Andreasson, H., Triebel, R., Lilienthal, A.: Vision-based interpolation of 3d laser scans (2006)

4. Bansal, A., Russell, B., Gupta, A.: Marr revisited: 2d-3d alignment via surface normal prediction (2016). arXiv:1604.01347

5. Bo Li, Chunhua Shen, Yuchao Dai, van den Hengel, A., Mingyi He: Depth and surface normal estimation from monocular images using regression on deep features and hierarchical crfs 1119–1127 (2015)

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Synthetic Data Enhancement and Network Compression Technology of Monocular Depth Estimation for Real-Time Autonomous Driving System;Sensors;2024-06-28

2. Comparative Analysis of Following Distances in Different Adaptive Cruise Control Systems at Steady Speeds;World Electric Vehicle Journal;2024-03-17