Analysis of the PointNet neural network architecture

Author:

Shchenyavskaya L. A.1,Gura D. A.2,Dyachenko R. A.1

Affiliation:

1. Kuban State Technological University

2. Kuban State Technological University; Kuban State Agrarian University

Abstract

Objective. Most researchers convert point cloud data into ordinary three-dimensional voxel grids or image collections, which makes the data unnecessarily voluminous and causes problems when processing them. The purpose of the study is to analyze the architecture of the PointNet neural network. Method. A unified approach has been applied to solving various 3D recognition problems, ranging from object classification, detail segmentation to semantic scene analysis. Result. A comparative analysis of the classification of 2d and 3d objects was carried out, the layers and functions through which classification occurs were studied in detail. A type of neural network is considered that directly uses point clouds, which takes into account the invariance of permutations of points in the input data. The network is determined to provide a unified architecture for applications ranging from object classification, part segmentation, and scene semantics. For semantic segmentation, the input data can be either a single object from the part area segmentation or a small part of the 3D scene. A neural network that is widely used for raster image editing, graphic design, and digital art is a deep point cloud architecture called PointNet. Conclusion. A new deep point cloud architecture, PointNet, is introduced. For object classification task, the input point cloud is directly selected from the shape or pre-segmented from the scene point cloud. To obtain a virtual model of the real world, neural network solutions are used, based on the assumption that there is an RGB point cloud obtained by an RGB-D camera from one or several angles.

Publisher

FSB Educational Establishment of Higher Education Daghestan State Technical University

Subject

Polymers and Plastics,General Environmental Science

Reference20 articles.

1. Mozhaev A. N. Segmentation of point clouds by means of the point cloud library. Extreme robotics. 2018; 1(1):301-308. – EDN YNCUTJ (In Russ)

2. Zhu X. X., Tuia D., Mou L., Xia G. S., Zhang L., Xu F., Fraundorfer F. Deep Learning in Remote Sensing: A Comprehensive Review and List of Resources. IEEE Geoscience and Remote Sensing Magazine. 2017; 8–36. DOI: 10.1109/MGRS.2017.2762307

3. Ferrit S. E. Patterns and practice of deep learning / translated from English by A.V. Logunov. – M.: DMK Press, 2022; 538. ISBN 978-5-93700-113-9(In Russ)

4. Aliyev R.M., Morozova O.N. “Comparative analysis of the application of point cloud processing methods Paint Net and Paint Net++ for the task of segmentation of 3D objects”. Proceedings of the Institute of System Programming of the Russian Academy of Sciences. 2020; 29(1):37-54. (In Russ)

5. Melnik S.P., Ivanov I.V. Analysis of the PointNet method for the task of segmentation of three-dimensional objects”. Information technologies and computer engineering. 2018; 16( 5): 951-960. (In Russ)

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3