You Only Need One Thing One Click: Self-Training for Weakly Supervised 3D Scene Understanding-Reference-Cited by-同舟云学术

You Only Need One Thing One Click: Self-Training for Weakly Supervised 3D Scene Understanding

Published:2024-01 Issue: Volume:02 Page:
ISSN:2811-0323
Container-title:World Scientific Annual Review of Artificial Intelligence
language:en
Short-container-title:World Sci. Ann. Rev. Artif. Intell.

Author:

Liu Zhengzhe¹^ORCID,Qi Xiaojuan²^ORCID,Fu Chi-Wing¹^ORCID

Affiliation:

1. Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong

2. Electrical and Electronic Engineering, The University of Hong Kong, Hong Kong

Abstract

Understanding 3D scenes, such as semantic segmentation and instance identification within point clouds, typically demands extensive annotated datasets. However, generating point-by-point labels is an overly laborious process. While recent techniques have been developed to train 3D networks with a minimal fraction of labeled points, our method, dubbed “One Thing One Click,” simplifies this by requiring just a single label per object. To effectively utilize these sparse annotations during network training, we’ve crafted an innovative self-training strategy. This involves alternating between training phases and label spreading, powered by a graph propagation module. Additionally, we integrate a relation network to create category-specific prototypes, improving pseudo label accuracy and steering the training process. Our approach also seamlessly integrates with 3D instance segmentation, incorporating a point-clustering technique. Our method demonstrates superior performance over other weakly supervised strategies for 3D semantic and instance segmentation, as evidenced by tests on both ScanNet-v2 and S3DIS datasets. Remarkably, the efficacy of our self-training method with limited annotations rivals that of fully supervised models. Codes and models are available at https://github.com/liuzhengzhe/One-Thing-One-Click .

Publisher

World Scientific Pub Co Pte Ltd

Link

https://www.worldscientific.com/doi/pdf/10.1142/S2811032324400058

Reference72 articles.

1. J. Wei, G. Lin, K.H. Yap, T.Y. Hung and L. Xie, Proc IEEE/CVF Conf Computer Vision and Pattern Recognition (CVPR), IEEE, 2020, pp. 4384–4393.

2. X. Xu and G. H. Lee, Proc IEEE/CVF Conf Computer Vision and Pattern Recognition (CVPR), IEEE, 2020, pp. 13706–13715.

3. M. Li, Y. Xie, Y. Shen, B. Ke, R. Qiao, B. Ren, S. Lin and L. Ma, Proc IEEE/CVF Conf Computer Vision and Pattern Recognition, IEEE, 2022, pp. 14930–14939.

4. SQN: Weakly-Supervised Semantic Segmentation of Large-Scale 3D Point Clouds

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A review of point cloud segmentation for understanding 3D indoor scenes;Visual Intelligence;2024-06-07