Breaking Boundaries Between Linguistics and Artificial Intelligence-Reference-Cited by-同舟云学术

Breaking Boundaries Between Linguistics and Artificial Intelligence

Published:2023-11-21 Issue:1 Volume:35 Page:1-20
ISSN:1546-2234
Container-title:Journal of Organizational and End User Computing
language:ng
Short-container-title:

Author:

Wang Jinhai¹,Tie Yi²,Jiang Xia¹,Xu Yilin³

Affiliation:

1. School of Foreign Languages, Zhengzhou University of Aeronautics, China

2. School of International Studies, Zhengzhou University, China

3. School of Business, Zhengzhou University of Aeronautics, China

Abstract

There is a wide connection between linguistics and artificial intelligence (AI), including the multimodal language matching. Multi-modal robots possess the capability to process various sensory modalities, including vision, auditory, language, and touch, offering extensive prospects for applications across various domains. Despite significant advancements in perception and interaction, the task of visual-language matching remains a challenging one for multi-modal robots. Existing methods often struggle to achieve accurate matching when dealing with complex multi-modal data, leading to potential misinterpretation or incomplete understanding of information. Additionally, the heterogeneity among different sensory modalities adds complexity to the matching process. To address these challenges, we propose an approach called vision-language matching with semantically aligned embeddings (VLMS), aimed at improving the visual-language matching performance of multi-modal robots.

Publisher

IGI Global

Subject

Strategy and Management,Computer Science Applications,Human-Computer Interaction

Reference54 articles.

1. Alshehri, H. A., Junath, N., & Panwar, P. (2022). Self-attention based edge computing model for synthesis image to text through next-generation AI mechanism. Mathematical Problems in Engineering.

2. Al Faraby, H., Azad, M. M., & Fedous, M. R. (2020). Image to Bengali caption generation using deep CNN and bidirectional gated recurrent unit. The 23rd international conference on computer and information technology (ICCIT). IEEE.

3. Deep Gated Multi-modal Learning: In-hand Object Pose Changes Estimation using Tactile and Image Data

4. The predicting public sentiment evolution on public emergencies under deep learning and internet of things

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Carbon Emission Reduction Effects of Heterogeneous Environmental Regulation: Evidence from the Firm Level;Ecological Chemistry and Engineering S;2024-06-01