1. Vatt: Transformers for multimodal self-supervised learning from raw video, audio and text;Akbari;Advances in Neural Information Processing Systems,2021
2. Self-supervised multimodal versatile networks;Alayrac;Advances in Neural Information Processing Systems,2020
3. Tutorial on amortized optimization for learning to optimize over continuous domains;Amos,2022
4. Learning to learn by gradient descent by gradient descent;Andrychowicz;Advances in neural information processing systems,2016
5. Look, Listen and Learn