A multimodal multitask deep learning framework for vibrotactile feedback and sound rendering-Reference-Cited by-同舟云学术

A multimodal multitask deep learning framework for vibrotactile feedback and sound rendering

Published:2024-06-10 Issue:1 Volume:14 Page:
ISSN:2045-2322
Container-title:Scientific Reports
language:en
Short-container-title:Sci Rep

Author:

Joolee Joolekha Bibi,Uddin Md Azher

Abstract

AbstractData-driven approaches are often utilized to model and generate vibrotactile feedback and sounds for rigid stylus-based interaction. Nevertheless, in prior research, these two modalities were typically addressed separately due to challenges related to synchronization and design complexity. To this end, we introduce a novel multimodal multitask deep learning framework. In this paper, we developed a comprehensive end-to-end data-driven system that encompasses the capture of contact acceleration signals and sound data from various texture surfaces. This framework introduces novel encoder-decoder networks for modeling and rendering vibrotactile feedback through an actuator while routing sound to headphones. The proposed encoder-decoder networks incorporate stacked transformers with convolutional layers to capture both local variability and overall trends within the data. To the best of our knowledge, this is the first attempt to apply transformer-based data-driven approach for modeling and rendering of vibrotactile signals as well as sounds during tool-surface interactions. In numerical evaluations, the proposed framework demonstrates a lower RMS error compared to state-of-the-art models for both vibrotactile signals and sound data. Additionally, subjective similarity evaluation also confirm the superiority of proposed method over state-of-the-art.

Publisher

Springer Science and Business Media LLC

Link

https://www.nature.com/articles/s41598-024-64376-y.pdf

Reference33 articles.

1. Chan, S., Tymms, C., & Colonnese, N. Hasti: Haptic and audio synthesis for texture interactions. In Proceedings of the IEEE world haptics conference (WHC), Montreal, QC, Canada, pp. 733–738, (2021). https://doi.org/10.1109/WHC49131.2021.9517177.

2. Culbertson, H., Unwin, J. & Kuchenbecker, K. J. Modeling and rendering realistic textures from unconstrained tool-surface interactions. IEEE Trans. Haptics 7(3), 381–393 (2014).

3. Nai, W. et al. Vibrotactile feedback rendering of patterned textures using a waveform segment table method. IEEE Trans. Haptics 14(4), 849–861. https://doi.org/10.1109/TOH.2021.3084304 (2021).

4. Joolee, J. B. & Jeon, S. Data-driven haptic texture modeling and rendering based on deep spatio-temporal networks. IEEE Trans. Haptics 15(1), 62–67. https://doi.org/10.1109/TOH.2021.3137936 (2022).

5. Lu, S., Chen, Y. & Culbertson, H. Towards multisensory perception: Modeling and rendering sounds of tool-surface interactions. IEEE Trans. Haptics 13(1), 94–101. https://doi.org/10.1109/TOH.2020.2966192 (2020).