Abstract
Three-dimensional (3D) object detection is an important research in 3D computer vision with significant applications in many fields, such as automatic driving, robotics, and human–computer interaction. However, the low precision is an urgent problem in the field of 3D object detection. To solve it, we present a framework for 3D object detection in point cloud. To be specific, a designed Backbone Network is used to make fusion of low-level features and high-level features, which makes full use of various information advantages. Moreover, the two-dimensional (2D) Generalized Intersection over Union is extended to 3D use as part of the loss function in our framework. Empirical experiments of Car, Cyclist, and Pedestrian detection have been conducted respectively on the KITTI benchmark. Experimental results with average precision (AP) have shown the effectiveness of the proposed network.
Subject
Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry
Reference40 articles.
1. Binary Volumetric Convolutional Neural Networks for 3-D Object Recognition;Chao;IEEE Trans. Instrum. Meas.,2019
2. Learning Multi-view Representation with LSTM for 3D Shape Recognition and Retrieval;Chao;IEEE Trans. Multimed.,2019
3. Ask Me Anything Dynamic Memory Networks for Natural Language Processing;Ankit;arXiv,2015
4. Deep Convolutional Networks for Natural Language Processing;Alexis;arXiv,2018
Cited by
33 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献