Publisher
Springer Science and Business Media LLC
Subject
Computer Networks and Communications,Hardware and Architecture,Media Technology,Software
Reference92 articles.
1. Akbari H, Yuan L, Qian R, Chuang W-H, Chang S-F, Cui Y, Gong B (2021) Vatt: Transformers for multimodal self-supervised learning from raw video, audio and text. Proceedings of the Conference on Neural Information Processing Systems (NIPS) 34:24206–24221
2. Baltrušaitis T, Ahuja C, Morency L-P (2018) Multimodal machine learning: A survey and taxonomy. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 41(2):423–443
3. Bansal A, Sikka K, Sharma G, Chellappa R, Divakaran A (2018) Zero-shot object detection. In: Proceedings of the Proceedings of the European Conference on Computer Vision (ECCV), pp 384–400
4. Bhunia AK, Koley S, Khilji AFUR, Sain A, Chowdhury PN, Xiang T, Song Y-Z (2022) Sketching without worrying: Noise-tolerant sketch-based image retrieval. ArXiv arXiv:2203.14817
5. Cai Z, Vasconcelos N (2018) Cascade R-CNN: Delving into high quality object detection. In: Proc IEEE/CVF Conf Comput Vis Pattern Recognit (CVPR), pp6154–6162