Publisher
Springer Nature Switzerland
Reference52 articles.
1. Bain, M., Nagrani, A., Varol, G., Zisserman, A.: Frozen in time: a joint video and image encoder for end-to-end retrieval. In: International Conference on Computer Vision (ICCV), pp. 1728–1738 (2021)
2. Biten, A.F., Gómez, L., Rusiñol, M., Karatzas, D.: Good news, everyone! Context driven entity-aware captioning for news images. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 12466–12475 (2019)
3. Chen, S., Zhang, Y., Yang, Q.: Multi-task learning in natural language processing: an overview. arXiv preprint arXiv:2109.09138 (2021)
4. Lecture Notes in Computer Science;Y-C Chen,2020
5. Ch’ng, C., Chan, C.S., Liu, C.: Total-text: toward orientation robustness in scene text detection. Int. J. Doc. Anal. Recogn. 23(1), 31–52 (2020)