TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval-Reference-Cited by-同舟云学术

TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval

Published:2022 Issue: Volume: Page:319-335
ISSN:0302-9743
Container-title:Lecture Notes in Computer Science
language:
Short-container-title:

Author:

Liu Yuqi^ORCID,Xiong Pengfei,Xu Luhui,Cao Shengming^ORCID,Jin Qin^ORCID

Publisher

Springer Nature Switzerland

Link

https://link.springer.com/content/pdf/10.1007/978-3-031-19781-9_19

Reference51 articles.

1. Abernethy, J., Lee, C., Tewari, A.: Perturbation techniques in online learning and optimization. Perturbations, Optimization, and Statistics, p. 223 (2016)

2. Anne Hendricks, L., Wang, O., Shechtman, E., Sivic, J., Darrell, T., Russell, B.: Localizing moments in video with natural language. In: Proceedings of the IEEE international conference on computer vision, pp. 5803–5812 (2017)

3. Arnab, A., Dehghani, M., Heigold, G., Sun, C., Lučić, M., Schmid, C.: Vivit: A video vision transformer. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 6836–6846 (2021)

4. Bain, M., Nagrani, A., Varol, G., Zisserman, A.: Frozen in time: A joint video and image encoder for end-to-end retrieval. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 1728–1738 (2021)

5. Bertasius, G., Wang, H., Torresani, L.: Is space-time attention all you need for video understanding. arXiv preprint arXiv:2102.05095 (2021)

Cited by 36 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Rethink video retrieval representation for video captioning;Pattern Recognition;2024-12

2. LSECA: local semantic enhancement and cross aggregation for video-text retrieval;International Journal of Multimedia Information Retrieval;2024-07-22

3. bjEnet: a fast and accurate software bug localization method in natural language semantic space;Software Quality Journal;2024-07-22

4. Multilevel Semantic Interaction Alignment for Video–Text Cross-Modal Retrieval;IEEE Transactions on Circuits and Systems for Video Technology;2024-07

5. Cliprerank: An Extremely Simple Method For Improving Ad-Hoc Video Search;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14