PSANet: Pyramid Splitting and Aggregation Network for 3D Object Detection in Point Cloud-Reference-Cited by-同舟云学术

PSANet: Pyramid Splitting and Aggregation Network for 3D Object Detection in Point Cloud

Published:2020-12-28 Issue:1 Volume:21 Page:136
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Li Fangyu^ORCID,Jin Weizheng^ORCID,Fan Cien^ORCID,Zou Lian,Chen Qingsheng^ORCID,Li Xiaopeng,Jiang Hao,Liu Yifeng

Abstract

3D object detection in LiDAR point clouds has been extensively used in autonomous driving, intelligent robotics, and augmented reality. Although the one-stage 3D detector has satisfactory training and inference speed, there are still some performance problems due to insufficient utilization of bird’s eye view (BEV) information. In this paper, a new backbone network is proposed to complete the cross-layer fusion of multi-scale BEV feature maps, which makes full use of various information for detection. Specifically, our proposed backbone network can be divided into a coarse branch and a fine branch. In the coarse branch, we use the pyramidal feature hierarchy (PFH) to generate multi-scale BEV feature maps, which retain the advantages of different levels and serves as the input of the fine branch. In the fine branch, our proposed pyramid splitting and aggregation (PSA) module deeply integrates different levels of multi-scale feature maps, thereby improving the expressive ability of the final features. Extensive experiments on the challenging KITTI-3D benchmark show that our method has better performance in both 3D and BEV object detection compared with some previous state-of-the-art methods. Experimental results with average precision (AP) prove the effectiveness of our network.

Funder

National Key Research and Development Program of China

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/21/1/136/pdf

Reference35 articles.

1. Semantic image segmentation with deep convolutional nets and fully connected crfs;Chen;arXiv,2014

Cited by 16 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Cascade fusion of multi-modal and multi-source feature fusion by the attention for three-dimensional object detection;Engineering Applications of Artificial Intelligence;2024-07

2. Research on PointPillars Algorithm Based on Feature-Enhanced Backbone Network;Electronics;2024-03-27

3. MonoGhost: Lightweight Monocular GhostNet 3D Object Properties Estimation for Autonomous Driving;Robotics;2023-11-17

4. Position-Aware Voxel Aggregate Network for Two-Stage 3-D Object Detector;IEEE Sensors Journal;2023-08-15

5. SO-YOLOv5: Small object recognition algorithm for sea cucumber in complex seabed environment;Fisheries Research;2023-08