1. Dynamic memory networks for visual and textual question answering[C];Xiong,2016
2. Vqa: Visual question answering[C];Antol,2015
3. Bottom-up and top-down attention for image captioning and visual question answering[C];Anderson,2018
4. Ask me anything: Dynamic memory networks for natural language processing[C];Kumar,2016