Visual–tactile fusion object classification method based on adaptive feature weighting-Reference-Cited by-同舟云学术

Visual–tactile fusion object classification method based on adaptive feature weighting

Published:2023-07-01 Issue:4 Volume:20 Page:
ISSN:1729-8806
Container-title:International Journal of Advanced Robotic Systems
language:en
Short-container-title:International Journal of Advanced Robotic Systems

Author:

Zhang Peng¹^ORCID,Bai Lu¹,Shan Dongri²,Wang Xiaofang¹,Li Shuang¹,Zou Wenkai¹,Chen Zhenxue³

Affiliation:

1. School of Information and Automation Engineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan, China

2. School of Mechanical Engineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan, China

3. School of Control Science and Engineering, Shandong University, Jinan, China

Abstract

Visual–tactile fusion information plays a crucial role in robotic object classification. The fusion module in existing visual–tactile fusion models directly splices visual and tactile features at the feature layer; however, for different objects, the contributions of visual features and tactile features to classification are different. Moreover, direct concatenation may ignore features that are more beneficial for classification and will also increase computational costs and reduce model classification efficiency. To utilize object feature information more effectively and further improve the efficiency and accuracy of robotic object classification, we propose a visual–tactile fusion object classification method based on adaptive feature weighting in this article. First, a lightweight feature extraction module is used to extract the visual and tactile features of each object. Then, the two feature vectors are input into an adaptive weighted fusion module. Finally, the fused feature vector is input into the fully connected layer for classification, yielding the categories and physical attributes of the objects. In this article, extensive experiments are performed with the Penn Haptic Adjective Corpus 2 public dataset and the newly developed Visual-Haptic Adjective Corpus 52 dataset. The experimental results demonstrate that for the public dataset Penn Haptic Adjective Corpus 2, our method achieves a value of 0.9750 in terms of the area under the curve. Compared with the highest area under the curve obtained by the existing state-of-the-art methods, our method improves by 1.92%. Moreover, compared with the existing state-of-the-art methods, our method achieves the best results in training time and inference time; while for the novel Visual-Haptic Adjective Corpus 52 dataset, our method achieves values of 0.9827 and 0.9850 in terms of the area under the curve and accuracy metrics, respectively. Furthermore, the inference time reaches 1.559 s/sheet, demonstrating the effectiveness of the proposed method.

Funder

the National College Students Innovation and Entrepreneurship Training Program at Qilu University of Technology

Publisher

SAGE Publications

Subject

Artificial Intelligence,Computer Science Applications,Software

Link

http://journals.sagepub.com/doi/pdf/10.1177/17298806231191947

Reference35 articles.

1. Current Researches and Future Development Trend of Intelligent Robot: A Review

2. Editorial: ViTac: Integrating Vision and Touch for Multimodal and Cross-Modal Perception

3. Object recognition combining vision and touch

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Multimodal tactile sensing fused with vision for dexterous robotic housekeeping;Nature Communications;2024-08-11