1. Youtube-8m: A large-scale video classification benchmark;Abu-El-Haija,2016
2. Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
3. VQA: Visual Question Answering
4. Testimages: a large-scale archive for testing visual devices and basic image processing algorithms;Asuni;STAG,2014
5. Obfuscated gradients give a false sense of security: Circumventing defenses to adversarial examples;Athalye