Author:
Shao Yongxin,Sun Zhetao,Tan Aihong,Yan Tianhong
Abstract
Lidar-based 3D object detection and classification is a critical task for autonomous driving. However, inferencing from exceedingly sparse 3D data in real-time is a formidable challenge. Complex-YOLO solves the problem of point cloud disorder and sparsity by projecting it onto the bird’s-eye view and realizes real-time 3D object detection based on LiDAR. However, Complex-YOLO has no object height detection, a shallow network depth, and poor small-size object detection accuracy. To address these issues, this paper has made the following improvements: (1) adds a multi-scale feature fusion network to improve the algorithm’s capability to detect small-size objects; (2) uses a more advanced RepVGG as the backbone network to improve network depth and overall detection performance; and (3) adds an effective height detector to the network to improve the height detection. Through experiments, we found that our algorithm’s accuracy achieved good performance on the KITTI dataset, while the detection speed and memory usage were very superior, 48FPS on RTX3070Ti and 20FPS on GTX1060, with a memory usage of 841Mib.
Funder
Natural Science Foundation of Zhejiang Province
Subject
Artificial Intelligence,Biomedical Engineering
Reference41 articles.
1. A detection method of the rescue targets in the marine casualty based on improved YOLOv5s.;Bai;Front. Neurorobot.,2022
2. Yolov4: Optimal speed and accuracy of object detection.;Bochkovskiy;arXiv,2020
3. Multi-view 3d object detection network for autonomous driving;Chen;Proceedings of the IEEE conference on computer vision and pattern recognition,2017
4. Invariance of object detection in untrained deep neural networks.;Cheon;Front. Comput. Neurosci.,2022
5. Voxel r-cnn: Towards high performance voxel-based 3d object detection.;Deng;Proc. AAAI Conf. Artif. Intell.,2021
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献