Analysis of the PointNet neural network architecture-Reference-Cited by-同舟云学术

Analysis of the PointNet neural network architecture

Published:2024-01-23 Issue:4 Volume:50 Page:158-165
ISSN:2542-095X
Container-title:Herald of Dagestan State Technical University. Technical Sciences
language:
Short-container-title:Vestn. Dagest. gos. teh. univ., Teh. nauki

Author:

Shchenyavskaya L. A.¹,Gura D. A.²,Dyachenko R. A.¹

Affiliation:

1. Kuban State Technological University

2. Kuban State Technological University; Kuban State Agrarian University

Abstract

Objective. Most researchers convert point cloud data into ordinary three-dimensional voxel grids or image collections, which makes the data unnecessarily voluminous and causes problems when processing them. The purpose of the study is to analyze the architecture of the PointNet neural network. Method. A unified approach has been applied to solving various 3D recognition problems, ranging from object classification, detail segmentation to semantic scene analysis. Result. A comparative analysis of the classification of 2d and 3d objects was carried out, the layers and functions through which classification occurs were studied in detail. A type of neural network is considered that directly uses point clouds, which takes into account the invariance of permutations of points in the input data. The network is determined to provide a unified architecture for applications ranging from object classification, part segmentation, and scene semantics. For semantic segmentation, the input data can be either a single object from the part area segmentation or a small part of the 3D scene. A neural network that is widely used for raster image editing, graphic design, and digital art is a deep point cloud architecture called PointNet. Conclusion. A new deep point cloud architecture, PointNet, is introduced. For object classification task, the input point cloud is directly selected from the shape or pre-segmented from the scene point cloud. To obtain a virtual model of the real world, neural network solutions are used, based on the assumption that there is an RGB point cloud obtained by an RGB-D camera from one or several angles.

Publisher

FSB Educational Establishment of Higher Education Daghestan State Technical University

Subject

Polymers and Plastics,General Environmental Science

Reference20 articles.

1. Mozhaev A. N. Segmentation of point clouds by means of the point cloud library. Extreme robotics. 2018; 1(1):301-308. – EDN YNCUTJ (In Russ)

2. Zhu X. X., Tuia D., Mou L., Xia G. S., Zhang L., Xu F., Fraundorfer F. Deep Learning in Remote Sensing: A Comprehensive Review and List of Resources. IEEE Geoscience and Remote Sensing Magazine. 2017; 8–36. DOI: 10.1109/MGRS.2017.2762307

3. Ferrit S. E. Patterns and practice of deep learning / translated from English by A.V. Logunov. – M.: DMK Press, 2022; 538. ISBN 978-5-93700-113-9(In Russ)

4. Aliyev R.M., Morozova O.N. “Comparative analysis of the application of point cloud processing methods Paint Net and Paint Net++ for the task of segmentation of 3D objects”. Proceedings of the Institute of System Programming of the Russian Academy of Sciences. 2020; 29(1):37-54. (In Russ)

5. Melnik S.P., Ivanov I.V. Analysis of the PointNet method for the task of segmentation of three-dimensional objects”. Information technologies and computer engineering. 2018; 16( 5): 951-960. (In Russ)