Robust Classification and 6D Pose Estimation by Sensor Dual Fusion of Image and Point Cloud Data-Reference-Cited by-同舟云学术

Robust Classification and 6D Pose Estimation by Sensor Dual Fusion of Image and Point Cloud Data

Published:2024-02-16 Issue:2 Volume:20 Page:1-21
ISSN:1550-4859
Container-title:ACM Transactions on Sensor Networks
language:en
Short-container-title:ACM Trans. Sen. Netw.

Author:

Xu Yaming¹^ORCID,Wang Yan¹^ORCID,Li Boliang¹^ORCID

Affiliation:

1. Harbin Institute of Technolog (School of Astronautics), China

Abstract

It is an important aspect to fully leverage complementary sensors of images and point clouds for objects classification and six-dimensional (6D) pose estimation tasks. Prior works extract objects category from a single sensor such as RGB camera or LiDAR, limiting their robustness in the event that a key sensor is severely blocked or fails. In this work, we present a robust objects classification and 6D object pose estimation strategy by dual fusion of image and point cloud data. Instead of solely relying on 3D proposals or mature 2D object detectors, our model deeply integrates 2D and 3D information of heterogeneous data sources by a robustness dual fusion network and an attention-based nonlinear fusion function Attn-fun(.), achieving efficiency as well as high accuracy classification for even missed some data sources. Then, our method is also able to precisely estimate the transformation matrix between two input objects by minimizing the feature difference to achieve 6D object pose estimation, even under strong noise or with outliers. We deploy our proposed method not only to ModelNet40 datasets but also to a real fusion vision rotating platform for tracking objects in outer space based on the estimated pose.

Funder

China Academy of Space Technology

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3639705

Reference65 articles.

1. SalsaNet: Fast Road and Vehicle Segmentation in LiDAR Point Clouds for Autonomous Driving

2. PointNetLK: Robust & Efficient Point Cloud Registration Using PointNet

3. Synthesizing 3D Shapes via Modeling Multi-view Depth Maps and Silhouettes with Deep Generative Networks

4. Multimodal vehicle detection: fusing 3D-LIDAR and color camera data

5. Deep Learning for Image and Point Cloud Fusion in Autonomous Driving: A Review