1. Armen Aghajanyan Bernie Huang Candace Ross Vladimir Karpukhin Hu Xu Naman Goyal Dmytro Okhonko Mandar Joshi Gargi Ghosh Mike Lewis et al. 2022. Cm3: A causal masked multimodal model of the internet. arXiv preprint arXiv:2201.07520 (2022).
2. Flamingo: a visual language model for few-shot learning;Alayrac Jean-Baptiste;Advances in Neural Information Processing Systems,2022
3. Shivangi Aneja, Chris Bregler, and Matthias Nießner. 2021a. Cosmos: Catching out-of-context misinformation with self-supervised learning. arXiv preprint arXiv:2101.06278 (2021).
4. Shivangi Aneja, Cise Midoglu, Duc-Tien Dang-Nguyen, Sohail Ahmed Khan, Michael Riegler, Pål Halvorsen, Chris Bregler, and Balu Adsumilli. 2022. Acm multimedia grand challenge on detecting cheapfakes. arXiv preprint arXiv:2207.14534 (2022).
5. Shivangi Aneja, Cise Midoglu, Duc-Tien Dang-Nguyen, Michael Alexander Riegler, Paal Halvorsen, Matthias Nießner, Balu Adsumilli, and Chris Bregler. 2021b. MMSys' 21 grand challenge on detecting cheapfakes. arXiv preprint arXiv:2107.05297 (2021).