1. VQA: Visual question answering;Antol,2015
2. Deep visual-semantic alignments for generating image descriptions;Karpathy,2015
3. Interpretable counting for visual question answering;Trott,2018
4. TallyQA: answering complex counting questions;Acharya,2019
5. Learning to count objects in natural images for visual question answering;Zhang,2018