Affiliation:
1. Collage of Software, Xinjiang University, Xinjiang Uyghur Autonomous Region, China
2. Land and Property Department, China Railway Urumqi Bureau Group Co., Xinjiang Uyghur Autonomous Region, China
Abstract
In recent years, 3D object detection based on LiDAR point clouds is a key component of autonomous driving. In pursuit of enhancing the accuracy of 3D point cloud feature extraction and point cloud detection, this paper introduces a novel 3D object detection model, termed as Graph Self-Attention-RCNN (GA-RCNN). This model is designed to integrate voxel information and point location information, enhancing the quality of 3D object proposals while maintaining contextual accuracy. The first stage rectifies the previous approach that relied on local features for preselected boxes, overlooking crucial global contextual information. An improved method is suggested in this work, utilizing BEV to capture long-range dependencies via a cross-attention mechanism. The second stage addresses the overreliance on local neighborhood point feature extraction. The Graph Self-Attention Pooling method is proposed, characterized by its dynamic computation of contribution weights for inputs. This enhances the model’s flexibility and generalization performance. Extensive evaluations on KITTI and Waymo datasets demonstrate GA-RCNN’s superior accuracy compared to other methods, affirming its efficacy in 3D object detection.
Subject
Artificial Intelligence,General Engineering,Statistics and Probability
Reference14 articles.
1. Drone detection using sparse lidarmeasurements;Dogru;IEEE Robotics and Automation Letters,2022
2. Efficientmspso sampling for object detection and 6-d pose estimation in 3-dscenes;Xing;IEEE Transactions on Industrial Electronics,2021
3. Second: Sparsely embedded convolutionaldetection;Yan;Sensors,2018
4. Pointnet++: Deep hierarchicalfeature learning on point sets in a metric space;Qi;Advances inNeural Information Processing Systems
5. Attention is all you need;Vaswani;Advances in Neural Information Processing Systems