1. Blostein, D., Grbavec, A.: Recognition of mathematical notation. In: Handbook of Character Recognition and Document Image Analysis, pp. 557–582. World Scientific (1997)
2. Chu, X., Tian, Z., Zhang, B., et al.: Conditional positional encodings for vision transformers. arXiv preprint arXiv:2102.10882 (2021)
3. Deng, Y., Kanervisto, A., Ling, J., et al.: Image-to-markup generation with coarse-to-fine attention. In: International Conference on Machine Learning, pp. 980–989. PMLR (2017)
4. Dosovitskiy, A., Beyer, L., Kolesnikov, A., et al.: An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020)
5. Fu, Y., Liu, T., Gao, M., et al.: EDSL: An encoder-decoder architecture with symbol-level features for printed mathematical expression recognition. arXiv preprint arXiv:2007.02517 (2020)