1. Multimodal video retrieval with CLIP: a user study;Alpay Tayfun;Information Retrieval Journal,2023
2. Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval
3. Face video retrieval based on the deep CNN with RBF loss;Choi Young Rok;IEEE Transactions on Image Processing,2020
4. Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018).
5. Zuolin Dong, Jiahong Wei, Xiaoyu Chen, and Pengfei Zheng. 2020. Face detection in security monitoring based on artificial intelligence video retrieval technology. IEEE access 8 (2020), 63421–63433.