1. Support-set bottlenecks for video-text representation learning;patrick,2020
2. Pytorch: An imperative style, high-performance deep learning library;paszke;Advances in neural information processing systems,2019
3. HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips
4. Learning a text-video embedding from incomplete and heterogeneous data;miech,2018
5. End-to-end learning of visual representations from uncurated instructional videos;miech,2019