FusionPillars: A 3D Object Detection Network with Cross-Fusion and Self-Fusion-Reference-Cited by-同舟云学术

FusionPillars: A 3D Object Detection Network with Cross-Fusion and Self-Fusion

Published:2023-05-22 Issue:10 Volume:15 Page:2692
ISSN:2072-4292
Container-title:Remote Sensing
language:en
Short-container-title:Remote Sensing

Author:

Zhang Jing¹²³^ORCID,Xu Da¹²,Li Yunsong¹²,Zhao Liping⁴,Su Rui⁵

Affiliation:

1. State Key Laboratory of Integrated Service Network, Xidian University, Xi’an 710071, China

2. School of Telecommunication Engineering, Xidian University, Xi’an 710071, China

3. Guangzhou Institute of Technology, Xidian University, Guangzhou 510555, China

4. National Defense Science and Technology Innovation Research Institute, Beijing 100071, China

5. Xi’an Termony Electronic Technology Co., Ltd., Xi’an 710031, China

Abstract

In the field of unmanned systems, cameras and LiDAR are important sensors that provide complementary information. However, the question of how to effectively fuse data from two different modalities has always been a great challenge. In this paper, inspired by the idea of deep fusion, we propose a one-stage end-to-end network named FusionPillars to fuse multisensor data (namely LiDAR point cloud and camera images). It includes three branches: a point-based branch, a voxel-based branch, and an image-based branch. We design two modules to enhance the voxel-wise features in the pseudo-image: the Set Abstraction Self (SAS) fusion module and the Pseudo View Cross (PVC) fusion module. For the data from a single sensor, by considering the relationship between the point-wise and voxel-wise features, the SAS fusion module self-fuses the point-based branch and the voxel-based branch to enhance the spatial information of the pseudo-image. For the data from two sensors, through the transformation of the images’ view, the PVC fusion module introduces the RGB information as auxiliary information and cross-fuses the pseudo-image and RGB image of different scales to supplement the color information of the pseudo-image. Experimental results revealed that, compared to existing current one-stage fusion networks, FusionPillars yield superior performance, with a considerable improvement in the detection precision for small objects.

Publisher

MDPI AG

Subject

General Earth and Planetary Sciences

Link

https://www.mdpi.com/2072-4292/15/10/2692/pdf

Reference46 articles.

1. Qi, C.R., Su, H., Mo, K., and Guibas, L.J. (2017, January 21–26). Pointnet: Deep learning on point sets for 3d classification and segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.

2. Qi, C.R., Yi, L., Su, H., and Guibas, L.J. (2017). Pointnet++: Deep hierarchical feature learning on point sets in a metric space. Adv. Neural Inf. Process. Syst., 30.

3. Dynamic graph cnn for learning on point clouds;Wang;Acm Trans. Graph. (Tog),2019

4. Wang, Y., Chao, W.L., Garg, D., Hariharan, B., Campbell, M., and Weinberger, K.Q. (2019, January 15–20). Pseudo-lidar from visual depth estimation: Bridging the gap in 3d object detection for autonomous driving. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.

5. Engelcke, M., Rao, D., Wang, D.Z., Tong, C.H., and Posner, I. (June, January 29). Vote3deep: Fast object detection in 3d point clouds using efficient convolutional neural networks. Proceedings of the 2017 IEEE International Conference on Robotics and Automation (ICRA), Singapore.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Equal Emphasis on Data and Network: A Two-Stage 3D Point Cloud Object Detection Algorithm with Feature Alignment;Remote Sensing;2024-01-08

2. DVST: Deformable Voxel Set Transformer for 3D Object Detection from Point Clouds;Remote Sensing;2023-12-03

3. TranSDet: Toward Effective Transfer Learning for Small-Object Detection;Remote Sensing;2023-07-12