An Image-Text Matching Method for Multi-Modal Robots-Reference-Cited by-同舟云学术

An Image-Text Matching Method for Multi-Modal Robots

Published:2023-12-08 Issue:1 Volume:36 Page:1-21
ISSN:1546-2234
Container-title:Journal of Organizational and End User Computing
language:ng
Short-container-title:

Author:

Zheng Ke¹,Li Zhou¹

Affiliation:

1. Hunan Biological and Electromechanical Polytechnic, China

Abstract

With the rapid development of artificial intelligence and deep learning, image-text matching has gradually become an important research topic in cross-modal fields. Achieving correct image-text matching requires a strong understanding of the correspondence between visual and textual information. In recent years, deep learning-based image-text matching methods have achieved significant success. However, image-text matching requires a deep understanding of intra-modal information and the exploration of fine-grained alignment between image regions and textual words. How to integrate these two aspects into a single model remains a challenge. Additionally, reducing the internal complexity of the model and effectively constructing and utilizing prior knowledge are also areas worth exploring, therefore addressing the issues of excessive computational complexity in existing fine-grained matching methods and the lack of multi-perspective matching.

Publisher

IGI Global

Subject

Strategy and Management,Computer Science Applications,Human-Computer Interaction

Reference46 articles.

1. Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering

2. Research on the Influence Maximization Problem in Social Networks Based on the Multi-Functional Complex Networks Model

3. Global Relation-Aware Attention Network for Image-Text Retrieval

4. Chang S K, Kunil T L. (1981). Pictorial data-base systems. IEEE Computer, 14(11).

5. IMRAM: Iterative Matching With Recurrent Attention Memory for Cross-Modal Image-Text Retrieval

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Survey of Text-Matching Techniques;Information;2024-06-05