Author:
Pramanick Shraman,Nowara Ewa M.,Gleason Joshua,Castillo Carlos D.,Chellappa Rama
Publisher
Springer Nature Switzerland
Reference80 articles.
1. Akbari, H., et al.: VATT: transformers for multimodal self-supervised learning from raw video, audio and text. In: Advances in Neural Information Processing Systems, vol. 34, pp. 24206–24221 (2021)
2. Lecture Notes in Computer Science;G Baatz,2012
3. Berton, G., Masone, C., Caputo, B.: Rethinking visual geo-localization for large-scale applications. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4878–4888 (2022)
4. Brejcha, J., Čadík, M.: State-of-the-art in visual geo-localization. Pattern Anal. Appl. 20(3), 613–637 (2017)
5. Cao, L., Smith, J.R., Wen, Z., Yin, Z., Jin, X., Han, J.: Bluefinder: estimate where a beach photo was taken. In: Proceedings of the 21st International Conference on World Wide Web, pp. 469–470 (2012)
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献