Author:
Burns Andrea,Kim Donghyun,Wijaya Derry,Saenko Kate,Plummer Bryan A.
Publisher
Springer International Publishing
Reference44 articles.
1. Aharoni, R., Johnson, M., Firat, O.: Massively multilingual neural machine translation. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), June 2019
2. Anderson, P., et al.: Bottom-up and top-down attention for image captioning and visual question answering. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2018)
3. Antol, S., et al.: VQA: visual question answering. In: The IEEE International Conference on Computer Vision (ICCV) (2015)
4. Arora, S., Liang, Y., Ma, T.: A simple but tough-to-beat baseline for sentence embeddings. In: International Conference on Learning Representations (ICLR) (2017)
5. Artetxe, M., Labaka, G., Agirre, E.: Learning principled bilingual mappings of word embeddings while preserving monolingual invariance. In: Empirical Methods in Natural Language Processing (EMNLP), pp. 2289–2294 (2016)
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. MuMUR: Multilingual Multimodal Universal Retrieval;Information Retrieval Journal;2023-09-25
2. Teaching Structured Vision & Language Concepts to Vision & Language Models;2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR);2023-06
3. Improving Video Retrieval Using Multilingual Knowledge Transfer;Lecture Notes in Computer Science;2023
4. Cross-Lingual Cross-Modal Retrieval with Noise-Robust Learning;Proceedings of the 30th ACM International Conference on Multimedia;2022-10-10
5. Token Embeddings Alignment for Cross-Modal Retrieval;Proceedings of the 30th ACM International Conference on Multimedia;2022-10-10