Depth Information Precise Completion-GAN: A Precisely Guided Method for Completing Ill Regions in Depth Maps
-
Published:2023-07-24
Issue:14
Volume:15
Page:3686
-
ISSN:2072-4292
-
Container-title:Remote Sensing
-
language:en
-
Short-container-title:Remote Sensing
Author:
Qian Ren1, Qiu Wenfeng1, Yang Wenbang1, Li Jianhua1, Wu Yun1, Feng Renyang2, Wang Xinan3, Zhao Yong13
Affiliation:
1. College of Computer Science and Technology, Guizhou University, Guiyang 550025, China 2. School of Information, Guizhou University of Finance and Economics, Guiyang 550031, China 3. School of Electronic and Computer Engineering, Shenzhen Graduate School of Peking University, Shenzhen 518055, China
Abstract
In the depth map obtained through binocular stereo matching, there are many ill regions due to reasons such as lighting or occlusion. These ill regions cannot be accurately obtained due to the lack of information required for matching. Since the completion model based on Gan generates random results, it cannot accurately complete the depth map. Therefore, it is necessary to accurately complete the depth map according to reality. To address this issue, this paper proposes a depth information precise completion GAN (DIPC-GAN) that effectively uses the Guid layer normalization (GuidLN) module to guide the model for precise completion by utilizing depth edges. GuidLN flexibly adjusts the weights of the guiding conditions based on intermediate results, allowing modules to accurately and effectively incorporate the guiding information. The model employs multiscale discriminators to discriminate results of different resolutions at different generator stages, enhancing the generator’s grasp of overall image and detail information. Additionally, this paper proposes Attention-ResBlock, which enables all ResBlocks in each task module of the GAN-based multitask model to focus on their own task by sharing a mask. Even when the ill regions are large, the model can effectively complement the missing details in these regions. Additionally, the multiscale discriminator in the model enhances the generator’s robustness. Finally, the proposed task-specific residual module can effectively focus different subnetworks of a multitask model on their respective tasks. The model has shown good repair results on datasets, including artificial, real, and remote sensing images. The final experimental results showed that the model’s REL and RMSE decreased by 9.3% and 9.7%, respectively, compared to RDFGan.
Funder
Science and Technology Planning of Shenzhen Technology Research and Development Fund National Natural Science Foundation of China Science and Technology Foundation of Guizhou Province
Subject
General Earth and Planetary Sciences
Reference38 articles.
1. Silberman, N., Hoiem, D., Kohli, P., and Fergus, R. (2012, January 7–13). Indoor segmentation and support inference from rgbd images. Proceedings of the European Conference on Computer Vision, Florence, Italy. 2. Chiu, Y.P., Leou, J.J., and Hsiao, H.H. (2014, January 1–5). Super-resolution reconstruction for kinect 3D data. Proceedings of the 2014 IEEE International Symposium on Circuits and Systems (ISCAS), Melbourne, VIC, Australia. 3. Ma, F., and Karaman, S. (2018, January 21–25). Sparse-to-dense: Depth prediction from sparse depth samples and a single image. Proceedings of the 2018 IEEE International Conference on Robotics and Automation (ICRA), Brisbane, QLD, Australia. 4. Dumoulin, V., Shlens, J., and Kudlur, M. (2016). A learned representation for artistic style. arXiv. 5. Learning topology from synthetic data for unsupervised depth completion;Wong;IEEE Robot. Autom. Lett.,2021
|
|