GA-RCNN:Graph self-attention feature extraction for 3D object detection-Reference-Cited by-同舟云学术

GA-RCNN:Graph self-attention feature extraction for 3D object detection

Published:2024-01-08 Issue: Volume: Page:1-15
ISSN:1064-1246
Container-title:Journal of Intelligent & Fuzzy Systems
language:
Short-container-title:IFS

Author:

Yi Yangyang¹,Yu Long¹,Tian Shengwei¹,Gao Xuezhuang¹,Li Jie¹,Zhao Xingang²

Affiliation:

1. Collage of Software, Xinjiang University, Xinjiang Uyghur Autonomous Region, China

2. Land and Property Department, China Railway Urumqi Bureau Group Co., Xinjiang Uyghur Autonomous Region, China

Abstract

In recent years, 3D object detection based on LiDAR point clouds is a key component of autonomous driving. In pursuit of enhancing the accuracy of 3D point cloud feature extraction and point cloud detection, this paper introduces a novel 3D object detection model, termed as Graph Self-Attention-RCNN (GA-RCNN). This model is designed to integrate voxel information and point location information, enhancing the quality of 3D object proposals while maintaining contextual accuracy. The first stage rectifies the previous approach that relied on local features for preselected boxes, overlooking crucial global contextual information. An improved method is suggested in this work, utilizing BEV to capture long-range dependencies via a cross-attention mechanism. The second stage addresses the overreliance on local neighborhood point feature extraction. The Graph Self-Attention Pooling method is proposed, characterized by its dynamic computation of contribution weights for inputs. This enhances the model’s flexibility and generalization performance. Extensive evaluations on KITTI and Waymo datasets demonstrate GA-RCNN’s superior accuracy compared to other methods, affirming its efficacy in 3D object detection.

Publisher

IOS Press

Subject

Artificial Intelligence,General Engineering,Statistics and Probability

Reference14 articles.

1. Drone detection using sparse lidarmeasurements;Dogru;IEEE Robotics and Automation Letters,2022

2. Efficientmspso sampling for object detection and 6-d pose estimation in 3-dscenes;Xing;IEEE Transactions on Industrial Electronics,2021

3. Second: Sparsely embedded convolutionaldetection;Yan;Sensors,2018

4. Pointnet++: Deep hierarchicalfeature learning on point sets in a metric space;Qi;Advances inNeural Information Processing Systems

5. Attention is all you need;Vaswani;Advances in Neural Information Processing Systems