1. The role of social intelligence in personal development;Avlaev;JournalNX,2020
2. wav2vec 2.0: A framework for self-supervised learning of speech representations;Baevski;Advances in neural information processing systems,2020
3. A simple framework for contrastive learning of visual representations;Chen
4. Bert: Pre-training of deep bidirectional transformers for language understanding;Devlin,2018
5. MIST : Multi-modal Iterative Spatial-Temporal Transformer for Long-form Video Question Answering