PointPainting: 3D Object Detection Aided by Semantic Image Information-Reference-Cited by-同舟云学术

PointPainting: 3D Object Detection Aided by Semantic Image Information

Published:2023-03-06 Issue:5 Volume:23 Page:2868
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Gao Zhentong¹²³^ORCID,Wang Qiantong¹²^ORCID,Pan Zongxu¹²³^ORCID,Zhai Zhenyu¹²³^ORCID,Long Hui¹²³

Affiliation:

1. Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, China

2. Key Laboratory of Technology in Geo-Spatial Information Processing and Application System, Chinese Academy of Sciences, Beijing 100190, China

3. School of Electronic, Electrical and Communication Engineering, University of Chinese Academy of Sciences, Beijing 101408, China

Abstract

A multi-modal 3D object-detection method, based on data from cameras and LiDAR, has become a subject of research interest. PointPainting proposes a method for improving point-cloud-based 3D object detectors using semantic information from RGB images. However, this method still needs to improve on the following two complications: first, there are faulty parts in the image semantic segmentation results, leading to false detections. Second, the commonly used anchor assigner only considers the intersection over union (IoU) between the anchors and ground truth boxes, meaning that some anchors contain few target LiDAR points assigned as positive anchors. In this paper, three improvements are suggested to address these complications. Specifically, a novel weighting strategy is proposed for each anchor in the classification loss. This enables the detector to pay more attention to anchors containing inaccurate semantic information. Then, SegIoU, which incorporates semantic information, instead of IoU, is proposed for the anchor assignment. SegIoU measures the similarity of the semantic information between each anchor and ground truth box, avoiding the defective anchor assignments mentioned above. In addition, a dual-attention module is introduced to enhance the voxelized point cloud. The experiments demonstrate that the proposed modules obtained significant improvements in various methods, consisting of single-stage PointPillars, two-stage SECOND-IoU, anchor-base SECOND, and an anchor-free CenterPoint on the KITTI dataset.

Funder

Youth Innovation Promotion Association, CAS

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/23/5/2868/pdf

Reference40 articles.

1. Yoo, J.H., Kim, Y., Kim, J., and Choi, J.W. (2020, January 23–28). 3d-cvf: Generating joint camera and LiDAR features using cross-view spatial feature fusion for 3d object detection. Proceedings of the European Conference on Computer Vision, Glasgow, UK.

2. Xie, L., Xiang, C., Yu, Z., Xu, G., Yang, Z., Cai, D., and He, X. (2020, January 7–12). PI-RCNN: An efficient multi-sensor 3D object detector with point-based attentive cont-conv fusion module. Proceedings of the AAAI conference on Artificial Intelligence, New York, NY, USA.

3. Huang, T., Liu, Z., Chen, X., and Bai, X. (2020, January 23–28). Epnet: Enhancing point features with image semantics for 3d object detection. Proceedings of the European Conference on Computer Vision, Glasgow, UK.

4. Pang, S., Morris, D., and Radha, H. (2020, January 24–29). CLOCs: Camera-LiDAR object candidates fusion for 3D object detection. Proceedings of the 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Las Vegas, NV, USA.

5. Zheng, W., Tang, W., Jiang, L., and Fu, C.W. (2021, January 20–25). SE-SSD: Self-ensembling single-stage object detector from point cloud. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. 3D Object Detection Using Semantic Maps;2024 5th International Conference for Emerging Technology (INCET);2024-05-24