Reconstructing the local structures of Chinese ancient architecture using unsupervised depth estimation-Reference-Cited by-同舟云学术

Reconstructing the local structures of Chinese ancient architecture using unsupervised depth estimation

Published:2024-08-30 Issue:1 Volume:12 Page:
ISSN:2050-7445
Container-title:Heritage Science
language:en
Short-container-title:Herit Sci

Author:

Yao Xiaoling,Hu Lihua,Zhang Jifu

Abstract

AbstractDigitalization of ancient architectures is one of the effective means for the preservation of heritage structures, with 3D reconstruction based on computer vision being a key component of such digitalization techniques. However, Chinese ancient architectures are located in mountainous areas, and existing 3D reconstruction methods fall short in restoring the local structures of these architectures. This paper proposes a self-attention-guided unsupervised single image-based depth estimation method, providing innovative technical support for the reconstruction of local structures in Chinese ancient architectures. First, an attention module is constructed based on features extracted from architectural images learned by the encoder, and then embedded into the encoder-decoder to capture the interdependencies across local features. Second, a disparity map is generated using the loss constraint network, including reconstruction matching, smoothness of the disparity, and left-right disparity consistency. Third, an unsupervised architecture based on binocular image pairs is constructed to remove any potential adverse effects due to unknown scale or estimated pose errors. Finally, with the known baseline distance and camera focal length, the disparity map is converted into the depth map to perform the end-to-end depth estimation from a single image. Experiments on the our architecture dataset validates our method, and it performs well also well on KITTI.

Funder

National Natural Science Foundation of China

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1186/s40494-024-01433-9.pdf

Reference32 articles.

1. Liu X, Liu Y, Wang K, Zhang Y, Lei Y, An H, Wang M, Chen Y. A color prediction model for mending materials of the Yuquan Iron Pagoda in China based on machine learning. Herit Sci. 2024;12(1):183.

2. Ming Y, Meng X, Fan C, Yu H. Deep learning for monocular depth estimation: a review. Neurocomputing. 2021;438:14–33.

3. Yan L, Yu F, Dong C. EMTNet: efficient mobile transformer network for real-time monocular depth estimation. Pattern Anal Appl. 2023;26(4):1833–46.

4. Li S, Shi J, Song W, Hao A, Qin H. Hierarchical object relationship constrained monocular depth estimation. Pattern Recognit. 2021;120: 108116.

5. Garg R, Bg VK, Carneiro G, Reid I. Unsupervised cnn for single view depth estimation: geometry to the rescue. In: Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part VIII 14. Springer; 2016; p. 740–56.