Author:
Madasu Avinash,Aflalo Estelle,Stan Gabriela Ben Melech,Rosenman Shachar,Tseng Shao-Yen,Bertasius Gedas,Lal Vasudev
Publisher
Springer Science and Business Media LLC
Subject
Library and Information Sciences,Information Systems
Reference67 articles.
1. Amrani, E., Ben-Ari, R., Rotman, D., & Bronstein, A. (2021). Noise estimation using density estimation for self-supervised multimodal learning. Proceedings of the AAAI Conference on Artificial Intelligence, 35, 6644–6652.
2. Anne Hendricks, L., Wang, O., Shechtman, E., Sivic, J., Darrell, T., & Russell, B. (2017). Localizing moments in video with natural language. In: Proceedings of the IEEE international conference on computer vision. 5803–5812.
3. Artetxe, M., Ruder, S., & Yogatama, D. (2019). On the cross-lingual transferability of monolingual representations
4. Bain, M., Nagrani, A., Varol, G., & Zisserman, A. (2021). Frozen in time: A joint video and image encoder for end-to-end retrieval. In: Proceedings of the IEEE/CVF International Conference on Computer Vision. 1728–1738.
5. Bertasius, G., Wang, H., & Torresani, L. (2021). Is space-time attention all you need for video understanding? In: ICML. 2, 4.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献