AMFF-Net: An Effective 3D Object Detector Based on Attention and Multi-Scale Feature Fusion-Reference-Cited by-同舟云学术

AMFF-Net: An Effective 3D Object Detector Based on Attention and Multi-Scale Feature Fusion

Published:2023-11-22 Issue:23 Volume:23 Page:9319
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Li Guangping¹,Mo Zuanfang¹,Ling Bingo Wing-Kuen¹^ORCID

Affiliation:

1. School of Information Engineering, Guangdong University of Technology, Guangzhou 510006, China

Abstract

With the advent of autonomous vehicle applications, the importance of LiDAR point cloud 3D object detection cannot be overstated. Recent studies have demonstrated that methods for aggregating features from voxels can accurately and efficiently detect objects in large, complex 3D detection scenes. Nevertheless, most of these methods do not filter background points well and have inferior detection performance for small objects. To ameliorate this issue, this paper proposes an Attention-based and Multiscale Feature Fusion Network (AMFF-Net), which utilizes a Dual-Attention Voxel Feature Extractor (DA-VFE) and a Multi-scale Feature Fusion (MFF) Module to improve the precision and efficiency of 3D object detection. The DA-VFE considers pointwise and channelwise attention and integrates them into the Voxel Feature Extractor (VFE) to enhance key point cloud information in voxels and refine more-representative voxel features. The MFF Module consists of self-calibrated convolutions, a residual structure, and a coordinate attention mechanism, which acts as a 2D Backbone to expand the receptive domain and capture more contextual information, thus better capturing small object locations, enhancing the feature-extraction capability of the network and reducing the computational overhead. We performed evaluations of the proposed model on the nuScenes dataset with a large number of driving scenarios. The experimental results showed that the AMFF-Net achieved 62.8% in the mAP, which significantly boosted the performance of small object detection compared to the baseline network and significantly reduced the computational overhead, while the inference speed remained essentially the same. AMFF-Net also achieved advanced performance on the KITTI dataset.

Funder

National Natural Science Foundation of China

Science and Technology Program of Daya Bay

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/23/23/9319/pdf

Reference47 articles.

1. Robust target recognition and tracking of self-driving cars with radar and camera information fusion under severe weather conditions;Liu;IEEE Trans. Intell. Transp. Syst.,2021

2. A survey on 3D object detection methods for autonomous driving applications;Arnold;IEEE Trans. Intell. Transp. Syst.,2019

3. Deep 3D object detection networks using LiDAR data: A review;Wu;IEEE Sens. J.,2020

4. Deep learning for 3D point clouds: A survey;Guo;IEEE Trans. Pattern Anal. Mach. Intell.,2020

5. Qi, C.R., Yi, L., Su, H., and Guibas, L.J. (2017). Pointnet++: Deep hierarchical feature learning on point sets in a metric space. Adv. Neural Inf. Process. Syst., 30.