Abstract
Abstract
Purpose and Methods: Currently, most network models for point cloud segmentation face challenges such as inadequate capture of continuous structural information on the point cloud surface and low feature recognition rates. This paper first investigates the local surface structure of point clouds and introduces the Local Surface Distribution (LSD) structure for analyzing and learning point clouds. This structure calculates the distribution of local neighborhoods in a per-point manner, enhancing the continuity of structural information within the point cloud. Then we propose X-Net, a model structure based on continuous encoding and decoding. Based on LSD and X-Net, we present the LSDX model framework for point cloud segmentation. Furthermore, we specify two implementations of LSDX, namely LSDX-PN2 and LSDX-PT.
Results and Conclusion: We evaluate our proposed LSDX on several challenging benchmarks, and experimental results demonstrate that LSDX produces more accurate segmentation boundary and significantly reduces isolated points in the prediction results than other methods. In particular, LSDX-PT, which takes Point Transformer as the baseline, outperforms the latest model Geospark and PointMetaBase on ScanNetv2 and S3DIS benchmarks, achieving state-of-the-art performance.
Publisher
Research Square Platform LLC
Reference74 articles.
1. Armeni, Iro and Sener, Ozan and Zamir, Amir R and Jiang, Helen and Brilakis, Ioannis and Fischer, Martin and Savarese, Silvio (2016) 3d semantic parsing of large-scale indoor spaces. 1534--1543, Proceedings of the IEEE conference on computer vision and pattern recognition
2. Atzmon, Matan and Maron, Haggai and Lipman, Yaron (2018) Point Convolutional Neural Networks by Extension Operators. ACM Transactions on Graphics 37(4)
3. Bouvrie, Jake (2006) Notes on convolutional neural networks.
4. Chang, Angel X and Funkhouser, Thomas and Guibas, Leonidas and Hanrahan, Pat and Huang, Qixing and Li, Zimo and Savarese, Silvio and Savva, Manolis and Song, Shuran and Su, Hao and others (2015) Shapenet: An information-rich 3d model repository. arXiv preprint arXiv:1512.03012
5. Chen, Lifang and Zhang, Qian (2023) DDGCN: graph convolution network based on direction and distance for point cloud learning. The Visual Computer 39(3): 863--873 Springer