Affiliation:
1. Dalian University of Technology, China
2. Shandong University of Science and Technology, China
3. The Chinese University of Hong Kong, Hong Kong
Abstract
Estimating the depth of transparent objects is one of the well-known challenges of RGB-D cameras due to the reflection and refraction effects. Previously, researchers propose to correct the depth of transparent objects by using their estimated segmentation masks, because it is possible to recover the internal depth of an object just from its boundary, as illustrated by those depth-from-silhouette methods. However, these algorithms only use segmentation masks. They ignore the internal structure information from the mask segmentation features, which we argue are more useful for transparent depth estimation. In this work, we demonstrate the effectiveness of segmentation features for transparent object depth estimation. We show that it is even possible to recover the depth map just from segmentation features, without any RGB or depth map as input. Based on this observation, we propose DualTransNet which uses segmentation features for transparent depth completion. In our DualTransNet, we feed segmentation features from an extra module to the main network for better depth completion quality. Extensive experiments have shown the superiority of segmentation features as well as the state-of-the-art performance of our network.
Publisher
Association for Computing Machinery (ACM)
Reference55 articles.
1. Dynamic Message Propagation Network for RGB-D and Video Salient Object Detection
2. UAMD-Net: A Unified Adaptive Multimodal Neural Network for Dense Depth Completion
3. TODE-Trans: Transparent Object Depth Estimation with Transformer
4. Liang-Chieh Chen, George Papandreou, Iasonas Kokkinos, Kevin Murphy, and Alan L Yuille. 2017. Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE transactions on pattern analysis and machine intelligence 40, 4 (2017), 834–848.
5. Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation