On Robustness of Multi-Modal Fusion—Robotics Perspective-Reference-Cited by-同舟云学术

On Robustness of Multi-Modal Fusion—Robotics Perspective

Published:2020-07-16 Issue:7 Volume:9 Page:1152
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Bednarek Michal^ORCID,Kicki Piotr,Walas Krzysztof

Abstract

The efficient multi-modal fusion of data streams from different sensors is a crucial ability that a robotic perception system should exhibit to ensure robustness against disturbances. However, as the volume and dimensionality of sensory-feedback increase it might be difficult to manually design a multimodal-data fusion system that can handle heterogeneous data. Nowadays, multi-modal machine learning is an emerging field with research focused mainly on analyzing vision and audio information. Although, from the robotics perspective, haptic sensations experienced from interaction with an environment are essential to successfully execute useful tasks. In our work, we compared four learning-based multi-modal fusion methods on three publicly available datasets containing haptic signals, images, and robots’ poses. During tests, we considered three tasks involving such data, namely grasp outcome classification, texture recognition, and—most challenging—multi-label classification of haptic adjectives based on haptic and visual data. Conducted experiments were focused not only on the verification of the performance of each method but mainly on their robustness against data degradation. We focused on this aspect of multi-modal fusion, as it was rarely considered in the research papers, and such degradation of sensory feedback might occur during robot interaction with its environment. Additionally, we verified the usefulness of data augmentation to increase the robustness of the aforementioned data fusion methods.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/9/7/1152/pdf

Reference42 articles.

1. Hearing lips and seeing voices

2. Multimodal Machine Learning: A Survey and Taxonomy

3. GelSight: High-Resolution Robot Tactile Sensors for Estimating Geometry and Force

4. Tracking objects with point clouds from vision and touch

5. Biomimetic Tactile Sensor Array

Cited by 20 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A comprehensive review of robot intelligent grasping based on tactile perception;Robotics and Computer-Integrated Manufacturing;2024-12

2. A systematic literature review of computer vision applications in robotized wire harness assembly;Advanced Engineering Informatics;2024-10

3. A Novel Visuo-Tactile Object Recognition Pipeline using Transformers with Feature Level Fusion;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30

4. Sync or Sink? The Robustness of Sensor Fusion Against Temporal Misalignment;2024 IEEE 30th Real-Time and Embedded Technology and Applications Symposium (RTAS);2024-05-13

5. RH20T: A Comprehensive Robotic Dataset for Learning Diverse Skills in One-Shot;2024 IEEE International Conference on Robotics and Automation (ICRA);2024-05-13