Deep Learning for 3D Reconstruction, Augmentation, and Registration: A Review Paper

Author:

Vinodkumar Prasoon Kumar1,Karabulut Dogus1,Avots Egils1,Ozcinar Cagri1,Anbarjafari Gholamreza1234ORCID

Affiliation:

1. iCV Lab, Institute of Technology, University of Tartu, 50090 Tartu, Estonia

2. PwC Advisory, 00180 Helsinki, Finland

3. iVCV OÜ, 51011 Tartu, Estonia

4. Institute of Higher Education, Yildiz Technical University, Beşiktaş, Istanbul 34349, Turkey

Abstract

The research groups in computer vision, graphics, and machine learning have dedicated a substantial amount of attention to the areas of 3D object reconstruction, augmentation, and registration. Deep learning is the predominant method used in artificial intelligence for addressing computer vision challenges. However, deep learning on three-dimensional data presents distinct obstacles and is now in its nascent phase. There have been significant advancements in deep learning specifically for three-dimensional data, offering a range of ways to address these issues. This study offers a comprehensive examination of the latest advancements in deep learning methodologies. We examine many benchmark models for the tasks of 3D object registration, augmentation, and reconstruction. We thoroughly analyse their architectures, advantages, and constraints. In summary, this report provides a comprehensive overview of recent advancements in three-dimensional deep learning and highlights unresolved research areas that will need to be addressed in the future.

Funder

SilentBorder

Publisher

MDPI AG

Reference217 articles.

1. Vinodkumar, P.K., Karabulut, D., Avots, E., Ozcinar, C., and Anbarjafari, G. (2023). A Survey on Deep Learning Based Segmentation, Detection and Classification for 3D Point Clouds. Entropy, 25.

2. Behley, J., Garbade, M., Milioto, A., Quenzel, J., Behnke, S., Stachniss, C., and Gall, J. (November, January 27). Semantickitti: A dataset for semantic scene understanding of lidar sequences. Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, Republic of Korea.

3. Armeni, I., Sener, O., Zamir, A.R., Jiang, H., Brilakis, I., Fischer, M., and Savarese, S. (July, January 26). 3d semantic parsing of large-scale indoor spaces. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.

4. Qi, C.R., Chen, X., Litany, O., and Guibas, L.J. (2020). ImVoteNet: Boosting 3D Object Detection in Point Clouds with Image Votes. arXiv.

5. Zhou, Y., and Tuzel, O. (2018, January 18–23). Voxelnet: End-to-end learning for point cloud based 3d object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3