A Two-Phase Cross-Modality Fusion Network for Robust 3D Object Detection-Reference-Cited by-同舟云学术

A Two-Phase Cross-Modality Fusion Network for Robust 3D Object Detection

Published:2020-10-23 Issue:21 Volume:20 Page:6043
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Jiao Yujun,Yin Zhishuai^ORCID

Abstract

A two-phase cross-modality fusion detector is proposed in this study for robust and high-precision 3D object detection with RGB images and LiDAR point clouds. First, a two-stream fusion network is built into the framework of Faster RCNN to perform accurate and robust 2D detection. The visible stream takes the RGB images as inputs, while the intensity stream is fed with the intensity maps which are generated by projecting the reflection intensity of point clouds to the front view. A multi-layer feature-level fusion scheme is designed to merge multi-modal features across multiple layers in order to enhance the expressiveness and robustness of the produced features upon which region proposals are generated. Second, a decision-level fusion is implemented by projecting 2D proposals to the space of the point cloud to generate 3D frustums, on the basis of which the second-phase 3D detector is built to accomplish instance segmentation and 3D-box regression on the filtered point cloud. The results on the KITTI benchmark show that features extracted from RGB images and intensity maps complement each other, and our proposed detector achieves state-of-the-art performance on 3D object detection with a substantially lower running time as compared to available competitors.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/20/21/6043/pdf

Reference58 articles.

1. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

2. Deep Nets: What have they ever done for Vision?;Yuille;arXiv,2018

3. 3D-CVF: Generating Joint Camera and LiDAR Features Using Cross-View Spatial Feature Fusion for 3D Object Detection;Yoo;arXiv,2020

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Comprehensive Review: 3d Object Detection Based on Visible Light Camera, Infrared Camera, and Lidar in Dark Scene;2024

2. Performance and Challenges of 3D Object Detection Methods in Complex Scenes for Autonomous Driving;IEEE Transactions on Intelligent Vehicles;2023-02

3. Classification of Tree Species and Standing Dead Trees with Lidar Point Clouds Using Two Deep Neural Networks: PointCNN and 3DmFV-Net;PFG – Journal of Photogrammetry, Remote Sensing and Geoinformation Science;2022-03-30