Picture it in your mind: generating high level visual representations from textual descriptions-Reference-Cited by-同舟云学术

Picture it in your mind: generating high level visual representations from textual descriptions

Published:2017-10-14 Issue:2-3 Volume:21 Page:208-229
ISSN:1386-4564
Container-title:Information Retrieval Journal
language:en
Short-container-title:Inf Retrieval J

Author:

Carrara Fabio,Esuli Andrea^ORCID,Fagni Tiziano,Falchi Fabrizio,Moreo Fernández Alejandro

Publisher

Springer Science and Business Media LLC

Subject

Library and Information Sciences,Information Systems

Link

http://link.springer.com/article/10.1007/s10791-017-9318-6/fulltext.html

Reference47 articles.

1. Bai, Y., Yu, W., Xiao, T., Xu, C., Yang, K., Ma, W.-Y., & Zhao, T. (2014). Bag-of-words based deep neural network for image retrieval. In Proceedings of the ACM international conference on multimedia (pp. 229–232). ACM.

2. Cappallo, S., Mensink, T., & Snoek, C. G. (2015). Image2emoji: Zero-shot emoji prediction for visual media. In Proceedings of the 23rd ACM international conference on multimedia, MM ’15 (pp. 1311–1314). New York, NY: ACM.

3. Chen, X., Fang, H., Lin, T.-Y., Vedantam, R., Gupta, S., Dollár, P., & Zitnick, C. L. (2015). Microsoft coco captions: Data collection and evaluation server. arXiv preprint arXiv:1504.00325 .

4. Cheng, H.-T., Koc, L., Harmsen, J., Shaked, T., Chandra, T., Aradhye, H., Anderson, G., Corrado, G., Chai, W., Ispir, M., et al. (2016). Wide & deep learning for recommender systems. In Proceedings of the 1st workshop on deep learning for recommender systems (pp. 7–10). ACM.

5. Cho, K., Van Merriënboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H., & Bengio, Y. (2014). Learning phrase representations using RNN encoder–decoder for statistical machine translation. arXiv preprint arXiv:1406.1078 .

Cited by 18 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Tri-factorized Modular Hypergraph Autoencoder for Multimodal Semantic Analysis;SN Computer Science;2024-09-09

2. RobustFace: a novel image restoration technique for face adversarial robustness improvement;Multimedia Tools and Applications;2024-05-06

3. AdvFAS: A robust face anti-spoofing framework against adversarial examples;Computer Vision and Image Understanding;2023-10

4. Text-to-Motion Retrieval: Towards Joint Understanding of Human Motion Data and Natural Language;Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval;2023-07-18

5. Bottom-Up Transformer Reasoning Network for Text-Image Retrieval;Communications in Computer and Information Science;2023