1. Zhu, R.: Enhance multimodal transformer with external label and in-domain pretrain: hateful meme challenge winning solution (2020)
2. Muennighoff, N.: Vilio: state-of-the-art visio-linguistic models applied to hateful memes (2020)
3. Velioglu, R., Rose, J.: Detecting hate speech in memes using multimodal deep learning approaches: prizewinning solution to hateful memes challenge (2020)
4. Lippe, P., et al.: A multimodal framework for the detection of hateful memes (2020)
5. Li, L.H., Yatskar, M., Yin, D., Hsieh, C.-J., Chang, K-W.: VisualBERT: a simple and performant baseline for vision and language (2019)