Deep Monocular Depth Estimation Based on Content and Contextual Features-Reference-Cited by-同舟云学术

Deep Monocular Depth Estimation Based on Content and Contextual Features

Published:2023-03-08 Issue:6 Volume:23 Page:2919
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Abdulwahab Saddam¹^ORCID,Rashwan Hatem A.¹^ORCID,Sharaf Najwa¹,Khalid Saif¹^ORCID,Puig Domenec¹

Affiliation:

1. Department of Computer Engineering and Mathematics, Universitat Rovira i Virgil, Campus Sescelades, Avinguda dels Paisos Catalans, 26, 43007 Tarragona, Spain

Abstract

Recently, significant progress has been achieved in developing deep learning-based approaches for estimating depth maps from monocular images. However, many existing methods rely on content and structure information extracted from RGB photographs, which often results in inaccurate depth estimation, particularly for regions with low texture or occlusions. To overcome these limitations, we propose a novel method that exploits contextual semantic information to predict precise depth maps from monocular images. Our approach leverages a deep autoencoder network incorporating high-quality semantic features from the state-of-the-art HRNet-v2 semantic segmentation model. By feeding the autoencoder network with these features, our method can effectively preserve the discontinuities of the depth images and enhance monocular depth estimation. Specifically, we exploit the semantic features related to the localization and boundaries of the objects in the image to improve the accuracy and robustness of the depth estimation. To validate the effectiveness of our approach, we tested our model on two publicly available datasets, NYU Depth v2 and SUN RGB-D. Our method outperformed several state-of-the-art monocular depth estimation techniques, achieving an accuracy of 85%, while minimizing the error Rel by 0.12, RMS by 0.523, and log10 by 0.0527. Our approach also demonstrated exceptional performance in preserving object boundaries and faithfully detecting small object structures in the scene.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/23/6/2919/pdf

Reference38 articles.

1. Simões, F., Almeida, M., Pinheiro, M., Dos Anjos, R., Dos Santos, A., Roberto, R., Teichrieb, V., Suetsugo, C., and Pelinson, A. (2012, January 28–31). Challenges in 3d reconstruction from images for difficult large-scale objects: A study on the modeling of electrical substations. Proceedings of the 2012 14th Symposium on Virtual and Augmented Reality, Rio de Janeiro, Brazil.

2. Adversarial Learning for Depth and Viewpoint Estimation From a Single Image;Abdulwahab;IEEE Trans. Circuits Syst. Video Technol.,2020

3. Monocular depth map estimation based on a multi-scale deep architecture and curvilinear saliency feature boosting;Abdulwahab;Neural Comput. Appl.,2022

4. Semantic understanding of scenes through the ade20k dataset;Zhou;Int. J. Comput. Vis.,2018

5. Hu, J., Shen, L., and Sun, G. (2018, January 18–22). Squeeze-and-excitation networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Optimizing depth estimation with attention U-Net;International Journal of System Assurance Engineering and Management;2024-07-20

2. Error Compensation of Inkjet-printed Electronics using Incremental Learning and Knowledge Distillation;2024 International Conference on Artificial Intelligence, Computer, Data Sciences and Applications (ACDSA);2024-02-01

3. Semantic Segmentation and Depth Estimation Based on Residual Attention Mechanism;Sensors;2023-08-28