1. Clip-adapter: Better vision-language models with feature adapters;gao;ArXiv Preprint,2021
2. Visualizing data using t-sne;van der maaten;Journal of Machine Learning Research,2008
3. Jensen-shannon diver-gence and hilbert space embedding;fuglede;International Symposium onInformation Theory,0
4. CLIP4Caption: CLIP for Video Caption
5. A kernel two-sample test;gretton;The Journal of Machine Learning Research,2012