An Underwater Human–Robot Interaction Using a Visual–Textual Model for Autonomous Underwater Vehicles-Reference-Cited by-同舟云学术

An Underwater Human–Robot Interaction Using a Visual–Textual Model for Autonomous Underwater Vehicles

Published:2022-12-24 Issue:1 Volume:23 Page:197
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Zhang Yongji^ORCID,Jiang Yu^ORCID,Qi Hong,Zhao Minghao,Wang Yuehang,Wang Kai,Wei Fenglin

Abstract

The marine environment presents a unique set of challenges for human–robot interaction. Communicating with gestures is a common way for interacting between the diver and autonomous underwater vehicles (AUVs). However, underwater gesture recognition is a challenging visual task for AUVs due to light refraction and wavelength color attenuation issues. Current gesture recognition methods classify the whole image directly or locate the hand position first and then classify the hand features. Among these purely visual approaches, textual information is largely ignored. This paper proposes a visual–textual model for underwater hand gesture recognition (VT-UHGR). The VT-UHGR model encodes the underwater diver’s image as visual features, the category text as textual features, and generates visual–textual features through multimodal interactions. We guide AUVs to use image–text matching for learning and inference. The proposed method achieves better performance than most existing purely visual methods on the dataset CADDY, demonstrating the effectiveness of using textual patterns for underwater gesture recognition.

Funder

National Natural Science Foundation of China

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/23/1/197/pdf

Reference48 articles.

1. A Survey of Underwater Human-Robot Interaction (U-HRI);Birk;Curr. Robot. Rep.,2022

2. Mišković, N., Egi, M., Nad, D., Pascoal, A., Sebastiao, L., and Bibuli, M. (September, January 30). Human-robot interaction underwater: Communication and safety requirements. Proceedings of the 2016 IEEE Third Underwater Communications and Networking Conference (UComms), Lerici, Italy.

3. Sun, K., Cui, W., and Chen, C. (2021). Review of Underwater Sensing Technologies and Applications. Sensors, 21.

4. A Kinect-Based Real-Time Compressive Tracking Prototype System for Amphibious Spherical Robots;Pan;Sensors,2015

5. Qin, R., Zhao, X., Zhu, W., Yang, Q., He, B., Li, G., and Yan, T. (2021). Multiple Receptive Field Network (MRF-Net) for Autonomous Underwater Vehicle Fishing Net Detection Using Forward-Looking Sonar Images. Sensors, 21.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. YOLOv8-MU: An Improved YOLOv8 Underwater Detector Based on a Large Kernel Block and a Multi-Branch Reparameterization Module;Sensors;2024-05-01

2. An Image-Text Matching Method for Multi-Modal Robots;Journal of Organizational and End User Computing;2023-12-08

3. Towards Multi-AUV Collaboration and Coordination: A Gesture-Based Multi-AUV Hierarchical Language and a Language Framework Comparison System;Journal of Marine Science and Engineering;2023-06-10