Object Reconstruction Based on Attentive Recurrent Network from Single and Multiple Images-Reference-Cited by-同舟云学术

Object Reconstruction Based on Attentive Recurrent Network from Single and Multiple Images

Published:2021-01-05 Issue:1 Volume:53 Page:653-670
ISSN:1370-4621
Container-title:Neural Processing Letters
language:en
Short-container-title:Neural Process Lett

Author:

Gao Zishu,Li En,Wang Zhe,Yang Guodong,Lu Jiwu,Ouyang Bo,Xu Dawei,Liang Zize

Abstract

AbstractThe application of traditional 3D reconstruction methods such as structure-from-motion and simultaneous localization and mapping are typically limited by illumination conditions, surface textures, and wide baseline viewpoints in the field of robotics. To solve this problem, many researchers have applied learning-based methods with convolutional neural network architectures. However, simply utilizing convolutional neural networks without taking other measures into account is computationally intensive, and the results are not satisfying. In this study, to obtain the most informative images for reconstruction, we introduce a residual block to a 2D encoder for improved feature extraction, and propose an attentive latent unit that makes it possible to select the most informative image being fed into the network rather than choosing one at random. The recurrent visual attentive network is injected into the auto-encoder network using reinforcement learning. The recurrent visual attentive network pays more attention to useful images, and the agent will quickly predict the 3D volume. This model is evaluated based on both single- and multi-view reconstructions. The experiment results show that the recurrent visual attentive network increases prediction performance in a way that is superior to other alternative methods, and our model has desirable capacity for generalization.

Funder

National Natural Science Foundation of China

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Computer Networks and Communications,General Neuroscience,Software

Link

http://link.springer.com/content/pdf/10.1007/s11063-020-10399-1.pdf

Reference35 articles.

1. Li C, Lu B, Zhang Y et al (2018) 3d reconstruction of indoor scenes via image registration. Neural Process Lett 48(3):1281–1304

2. Orts-Escolano S, Garcia-Rodriguez J, Morell V et al (2016) 3d surface reconstruction of noisy point clouds using growing neural gas: 3d object/scene reconstruction. Neural Process Lett 43(2):401–423

3. Snavely N, Seitz SM, Szeliski R (2006) Photo tourism: exploring photo collections in 3D. In: ACM Siggraph 2006 Papers, pp 835–846