Multi-feature Fusion VoteNet for 3D Object Detection-Reference-Cited by-同舟云学术

Multi-feature Fusion VoteNet for 3D Object Detection

Published:2022-01-27 Issue:1 Volume:18 Page:1-17
ISSN:1551-6857
Container-title:ACM Transactions on Multimedia Computing, Communications, and Applications
language:en
Short-container-title:ACM Trans. Multimedia Comput. Commun. Appl.

Author:

Wang Zhoutao¹,Xie Qian¹,Wei Mingqiang¹,Long Kun¹,Wang Jun¹

Affiliation:

1. Nanjing University of Aeronautics and Astronautics, Nanjing, Jiangsu, China

Abstract

In this article, we propose a Multi-feature Fusion VoteNet (MFFVoteNet) framework for improving the 3D object detection performance in cluttered and heavily occluded scenes. Our method takes the point cloud and the synchronized RGB image as inputs to provide object detection results in 3D space. Our detection architecture is built on VoteNet with three key designs. First, we augment the VoteNet input with point color information to enhance the difference of various instances in a scene. Next, we integrate an image feature module into the VoteNet to provide a strong object class signal that can facilitate deterministic detections in occlusion. Moreover, we propose a Projection Non-Maximum Suppression (PNMS) method in 3D object detection to eliminate redundant proposals and hence provide more accurate positioning of 3D objects. We evaluate the proposed MFFVoteNet on two challenging 3D object detection datasets, i.e., ScanNetv2 and SUN RGB-D. Extensive experiments show that our framework can effectively improve the performance of 3D object detection.

Funder

National Key Research and Development Program of China

Aeronautical Science Foundation of China

National Natural Science Foundation of China

Natural Science Foundation of Jiangsu Province

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture

Link

https://dl.acm.org/doi/pdf/10.1145/3462219

Reference55 articles.