Abstract
It is a challenging problem to infer objects with reasonable shapes and appearance from a single picture. Existing research often pays more attention to the structure of the point cloud generation network, while ignoring the feature extraction of 2D images and reducing the loss in the process of feature propagation in the network. In this paper, a single-stage and single-view 3D point cloud reconstruction network, 3D-SSRecNet, is proposed. The proposed 3D-SSRecNet is a simple single-stage network composed of a 2D image feature extraction network and a point cloud prediction network. The single-stage network structure can reduce the loss of the extracted 2D image features. The 2D image feature extraction network takes DetNet as the backbone. DetNet can extract more details from 2D images. In order to generate point clouds with better shape and appearance, in the point cloud prediction network, the exponential linear unit (ELU) is used as the activation function, and the joint function of chamfer distance (CD) and Earth mover’s distance (EMD) is used as the loss function of 3DSSRecNet. In order to verify the effectiveness of 3D-SSRecNet, we conducted a series of experiments on ShapeNet and Pix3D datasets. The experimental results measured by CD and EMD have shown that 3D-SSRecNet outperforms the state-of-the-art reconstruction methods.
Funder
Science and Technology Development Plan Project of Jilin Province
Subject
Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry
Reference29 articles.
1. Point Cloud Interaction and Manipulation in Virtual Reality;Garrido;Proceedings of the 2021 5th International Conference on Artificial Intelligence and Virtual Reality (AIVR),2021
2. Predicting 3D shapes, masks, and properties of materials inside transparent containers, using the TransProteus CGI dataset
3. TransLoc3D: Point Cloud based Large-scale Place Recognition using Adaptive Receptive Fields;Xu;arXiv,2021
4. A Point Set Generation Network for 3D Object Reconstruction from a Single Image
5. 3D-LMNet: Latent embedding matching for accurate and diverse 3D point cloud reconstruction from a single image;Mandikal;arXiv,2018
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献