Attention Enhanced Single Stage Multimodal Reasoner-Reference-Cited by-同舟云学术

Attention Enhanced Single Stage Multimodal Reasoner

Published:2020 Issue: Volume: Page:51-61
ISSN:0302-9743
Container-title:Computer Vision – ECCV 2020 Workshops
language:
Short-container-title:

Author:

Ou Jie,Zhang Xinying

Publisher

Springer International Publishing

Link

https://link.springer.com/content/pdf/10.1007/978-3-030-66096-3_5

Reference19 articles.

1. De Vries, H., Strub, F., Chandar, S., Pietquin, O., Larochelle, H., Courville, A.: Guesswhat?! visual object discovery through multi-modal dialogue. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5503–5512 (2017)

2. Deng, C., Wu, Q., Wu, Q., Hu, F., Lyu, F., Tan, M.: Visual grounding via accumulated attention. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7746–7755 (2018)

3. Deruyttere, T., Collell, G., Moens, M.F.: Giving commands to a self-driving car: a multimodal reasoner for visual grounding. arXiv preprint arXiv:2003.08717 (2020)

4. Deruyttere, T., Vandenhende, S., Grujicic, D., Van Gool, L., Moens, M.F.: Talk2car: Taking control of your self-driving car. arXiv preprint arXiv:1909.10838 (2019)

5. He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. GPT-4 enhanced multimodal grounding for autonomous driving: Leveraging cross-modal attention with large language models;Communications in Transportation Research;2024-12

2. Compound facial expressions recognition approach using DCGAN and CNN;Multimedia Tools and Applications;2024-08-28

3. Grounding Commands for Autonomous Vehicles via Layer Fusion with Region-specific Dynamic Layer Attention;2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS);2022-10-23

4. Commands 4 Autonomous Vehicles (C4AV) Workshop Summary;Computer Vision – ECCV 2020 Workshops;2020