1. Enhancing Video Retrieval with Robust CLIP-Based Multimodal System
2. Multimodal fusion in newsimages 2023: Evaluating translators, keyphrase extraction, and CLIP pre-training;Nguyen
3. A survey of diffusion based image generation models: Issues and their solutions;Zhang,2023
4. Yolov9: Learning what you want to learn using programmable gradient information;Wang,2024
5. Scb-dataset3: A benchmark for detecting student classroom behavior;Yang,2023