1. Agrawal, A., Batra, D., Parikh, D., Kembhavi, A.: Don’t just assume; look and answer: overcoming priors for visual question answering. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4971–4980 (2018)
2. Bengio, Y., Ducharme, R., Vincent, P., Janvin, C.: A neural probabilistic language model. J. Mach. Learn. Res. 3(6), 1137–1155 (2003)
3. Carlson, L.A., Regier, T., Lopez, W., Corrigan, B.: Attention unites form and function in spatial language. Spat. Cogn. Comput. 6(4), 295–308 (2006)
4. Collell, G., Van Gool, L., Moens, M.F.: Acquiring common sense spatial knowledge through implicit spatial templates. In: Thirty-Second AAAI Conference on Artificial Intelligence (2018)
5. Lecture Notes in Computer Science (Lecture Notes in Artificial Intelligence);KR Coventry,2005