Affiliation:
1. Department of Computer Science, COMSATS University Islamabad, Lahore Campus, Lahore 54000, Pakistan
2. National Centre for Computer Animation, Bournemouth University, Poole BH12 5BB, Dorset, UK
Abstract
Humans inherently perceive 3D scenes using prior knowledge and visual perception, but 3D reconstruction in computer graphics is challenging due to complex object geometries, noisy backgrounds, and occlusions, leading to high time and space complexity. To addresses these challenges, this study introduces 3DRecNet, a compact 3D reconstruction architecture optimized for both efficiency and accuracy through five key modules. The first module, the Human-Inspired Memory Network (HIMNet), is designed for initial point cloud estimation, assisting in identifying and localizing objects in occluded and complex regions while preserving critical spatial information. Next, separate image and 3D encoders perform feature extraction from input images and initial point clouds. These features are combined using a dual attention-based feature fusion module, which emphasizes features from the image branch over those from the 3D encoding branch. This approach ensures independence from proposals at inference time and filters out irrelevant information, leading to more accurate and detailed reconstructions. Finally, a Decoder Branch transforms the fused features into a 3D representation. The integration of attention-based fusion with the memory network in 3DRecNet significantly enhances the overall reconstruction process. Experimental results on the benchmark datasets, such as ShapeNet, ObjectNet3D, and Pix3D, demonstrate that 3DRecNet outperforms existing methods.
Reference37 articles.
1. Image-based 3D object reconstruction: State-of-the-art and trends in the deep learning era;Han;IEEE Trans. Pattern Anal. Mach. Intell.,2019
2. Sra, M., Garrido-Jurado, S., Schmandt, C., and Maes, P. (2016, January 2–4). Procedurally generated virtual reality from 3D reconstructed physical space. Proceedings of the 22nd ACM Conference on Virtual Reality Software and Technology, Munich, Germany.
3. A fast compressed sensing approach to 3D MR image reconstruction;Montefusco;IEEE Trans. Med. Imaging,2010
4. 3D building reconstruction from single street view images using deep learning;Pang;Int. J. Appl. Earth Obs. Geoinf.,2022
5. Yang, S., Xu, M., Xie, H., Perry, S., and Xia, J. (2021, January 20–25). Single-view 3D object reconstruction from shape priors in memory. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.