1. Anderson, P., He, X., Buehler, C., et al.: Bottom-up and top-down attention for image captioning and visual question answering. In: CVPR, pp. 6077–6086 (2018)
2. Blostein, D., Grbavec, A.: Recognition of mathematical notation. In: Handbook of Character Recognition and Document Image Analysis, pp. 557–582. World Scientific (1997)
3. Chen, X., Ma, L., Jiang, W., Yao, J., Liu, W.: Regularizing RNNs for caption generation by reconstructing the past with the present. In: CVPR, pp. 7995–8003 (2018)
4. Deng, Y., Kanervisto, A., Ling, J., Rush, A.M.: Image-to-markup generation with coarse-to-fine attention. In: ICML, pp. 980–989. PMLR (2017)
5. Fu, Y., Liu, T., Gao, M., Zhou, A.: EDSL: an encoder-decoder architecture with symbol-level features for printed mathematical expression recognition. arXiv preprint arXiv:2007.02517 (2020)