Depth Information Precise Completion-GAN: A Precisely Guided Method for Completing Ill Regions in Depth Maps-Reference-Cited by-同舟云学术

Depth Information Precise Completion-GAN: A Precisely Guided Method for Completing Ill Regions in Depth Maps

Published:2023-07-24 Issue:14 Volume:15 Page:3686
ISSN:2072-4292
Container-title:Remote Sensing
language:en
Short-container-title:Remote Sensing

Author:

Qian Ren¹,Qiu Wenfeng¹,Yang Wenbang¹,Li Jianhua¹,Wu Yun¹,Feng Renyang²,Wang Xinan³,Zhao Yong¹³

Affiliation:

1. College of Computer Science and Technology, Guizhou University, Guiyang 550025, China

2. School of Information, Guizhou University of Finance and Economics, Guiyang 550031, China

3. School of Electronic and Computer Engineering, Shenzhen Graduate School of Peking University, Shenzhen 518055, China

Abstract

In the depth map obtained through binocular stereo matching, there are many ill regions due to reasons such as lighting or occlusion. These ill regions cannot be accurately obtained due to the lack of information required for matching. Since the completion model based on Gan generates random results, it cannot accurately complete the depth map. Therefore, it is necessary to accurately complete the depth map according to reality. To address this issue, this paper proposes a depth information precise completion GAN (DIPC-GAN) that effectively uses the Guid layer normalization (GuidLN) module to guide the model for precise completion by utilizing depth edges. GuidLN flexibly adjusts the weights of the guiding conditions based on intermediate results, allowing modules to accurately and effectively incorporate the guiding information. The model employs multiscale discriminators to discriminate results of different resolutions at different generator stages, enhancing the generator’s grasp of overall image and detail information. Additionally, this paper proposes Attention-ResBlock, which enables all ResBlocks in each task module of the GAN-based multitask model to focus on their own task by sharing a mask. Even when the ill regions are large, the model can effectively complement the missing details in these regions. Additionally, the multiscale discriminator in the model enhances the generator’s robustness. Finally, the proposed task-specific residual module can effectively focus different subnetworks of a multitask model on their respective tasks. The model has shown good repair results on datasets, including artificial, real, and remote sensing images. The final experimental results showed that the model’s REL and RMSE decreased by 9.3% and 9.7%, respectively, compared to RDFGan.

Funder

Science and Technology Planning of Shenzhen

Technology Research and Development Fund

National Natural Science Foundation of China

Science and Technology Foundation of Guizhou Province

Publisher

MDPI AG

Subject

General Earth and Planetary Sciences

Link

https://www.mdpi.com/2072-4292/15/14/3686/pdf

Reference38 articles.

1. Silberman, N., Hoiem, D., Kohli, P., and Fergus, R. (2012, January 7–13). Indoor segmentation and support inference from rgbd images. Proceedings of the European Conference on Computer Vision, Florence, Italy.

2. Chiu, Y.P., Leou, J.J., and Hsiao, H.H. (2014, January 1–5). Super-resolution reconstruction for kinect 3D data. Proceedings of the 2014 IEEE International Symposium on Circuits and Systems (ISCAS), Melbourne, VIC, Australia.

3. Ma, F., and Karaman, S. (2018, January 21–25). Sparse-to-dense: Depth prediction from sparse depth samples and a single image. Proceedings of the 2018 IEEE International Conference on Robotics and Automation (ICRA), Brisbane, QLD, Australia.

4. Dumoulin, V., Shlens, J., and Kudlur, M. (2016). A learned representation for artistic style. arXiv.

5. Learning topology from synthetic data for unsupervised depth completion;Wong;IEEE Robot. Autom. Lett.,2021