Exploring latent weight factors and global information for food-oriented cross-modal retrieval-Reference-Cited by-同舟云学术

Exploring latent weight factors and global information for food-oriented cross-modal retrieval

Published:2023-07-28 Issue:1 Volume:35 Page:
ISSN:0954-0091
Container-title:Connection Science
language:en
Short-container-title:Connection Science

Author:

Zhao Wenyu¹,Zhou Dong²,Cao Buqing¹,Liang Wei¹,Sukhija Nitin³

Affiliation:

1. School of Computer Science and Technology, Hunan University of Science and Technology, Xiangtan, People’s Republic of China

2. School of Information Science and Technology, Guangdong University of Foreign Studies, Guangzhou, People’s Republic of China

3. Department of Computer Science, Slippery Rock University of Pennsylvania, Slippery Rock, PA, USA

Funder

Scientific Research Fund of Hunan Provincial Education Department

the Hunan Provincial Natural Science Foundation of China

Hunan Provincial Innovation Foundation for Postgraduate

Publisher

Informa UK Limited

Subject

Artificial Intelligence,Human-Computer Interaction,Software

Link

https://www.tandfonline.com/doi/pdf/10.1080/09540091.2023.2233714

Reference47 articles.

1. Arevalo, J., Solorio, T., Montes-y-Gómez, M., & González, F. A. (2017). Gated multimodal units for information fusion. Proceedings of the 5th international conference on learning Representations (ICLR, Workshop), Toulon, France.

2. Cross-Modal Retrieval in the Cooking Context

3. Deep Understanding of Cooking Procedure for Cross-modal Recipe Retrieval

4. Multimodal Encoders for Food-Oriented Cross-Modal Retrieval

5. Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., & Gelly, S. (2020). An image is worth 16x16 words: Transformers for image recognition at scale. Proceedings of the 9th international conference on learning representations (ICLR), online.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Disambiguity and Alignment: An Effective Multi-Modal Alignment Method for Cross-Modal Recipe Retrieval;Foods;2024-05-23