Point Cloud Hand–Object Segmentation Using Multimodal Imaging with Thermal and Color Data for Safe Robotic Object Handover-Reference-Cited by-同舟云学术

Point Cloud Hand–Object Segmentation Using Multimodal Imaging with Thermal and Color Data for Safe Robotic Object Handover

Published:2021-08-23 Issue:16 Volume:21 Page:5676
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Zhang Yan^ORCID,Müller Steffen,Stephan Benedict^ORCID,Gross Horst-Michael,Notni Gunther

Abstract

This paper presents an application of neural networks operating on multimodal 3D data (3D point cloud, RGB, thermal) to effectively and precisely segment human hands and objects held in hand to realize a safe human–robot object handover. We discuss the problems encountered in building a multimodal sensor system, while the focus is on the calibration and alignment of a set of cameras including RGB, thermal, and NIR cameras. We propose the use of a copper–plastic chessboard calibration target with an internal active light source (near-infrared and visible light). By brief heating, the calibration target could be simultaneously and legibly captured by all cameras. Based on the multimodal dataset captured by our sensor system, PointNet, PointNet++, and RandLA-Net are utilized to verify the effectiveness of applying multimodal point cloud data for hand–object segmentation. These networks were trained on various data modes (XYZ, XYZ-T, XYZ-RGB, and XYZ-RGB-T). The experimental results show a significant improvement in the segmentation performance of XYZ-RGB-T (mean Intersection over Union: 82.8% by RandLA-Net) compared with the other three modes (77.3% by XYZ-RGB, 35.7% by XYZ-T, 35.7% by XYZ), in which it is worth mentioning that the Intersection over Union for the single class of hand achieves 92.6%.

Funder

Freistaat Thüringen aus Mitteln des Europäischen Sozialfonds

Thüringer Aufbaubank

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/21/16/5676/pdf

Reference36 articles.

1. Multi-modal rgb–depth–thermal human body segmentation;Palmero;Int. J. Comput. Vis.,2016

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. ThermalNeRF: Thermal Radiance Fields;2024 IEEE International Conference on Computational Photography (ICCP);2024-07-22

2. Multimodal 3D measurement setup for generating multimodal real-world data sets for AI-based transparent object recognition;Dimensional Optical Metrology and Inspection for Practical Applications XIII;2024-06-07

3. Morphological estimation of primary branch length of individual apple trees during the deciduous period in modern orchard based on PointNet++;Computers and Electronics in Agriculture;2024-05

4. Fusion of Multimodal Imaging and 3D Digitization Using Photogrammetry;Sensors;2024-04-03

5. Data Fusion of RGB and Depth Data with Image Enhancement;Journal of Imaging;2024-03-21