1. Learning transferable visual models from natural language supervision;radford;International Conference on Machine Learning,0
2. StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery
3. Dynamic multimodal instance segmentation guided by natural language queries;margffoy-tuay;Proceedings of the European Conference on Computer Vision (ECCV),0
4. Generation and comprehension of unambiguous object descriptions;junhua;Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition,0
5. Clip4clip: An empirical study of clip for end to end video clip retrieval;luo;ArXiv Preprint,2021