1. L. Blecher, G. Cucurull, et al. Nougat: Neural optical understanding for academic documents, 2023. https://arxiv.org/abs/2308.13418.
2. S. Brinn, C. Cameron, et al. A framework for improving the accessibility of research papers on arxiv.org, 2024. https://arxiv.org/abs/2212.07286.
3. D. Cervone. MathJax: a platform for mathematics on the web. Notices of the AMS, 59(2):312–316, 2012.
4. C. Duan, Z. Tan, S. Bartsch. LaTeX rainbow: Universal LaTeX to PDF document semantic & layout annotation framework. In Proceedings of the Second Workshop on Information Extraction from Scientific Publications, T. Ghosal, F. Grezes, et al., eds., pp. 56–67, Bali, Indonesia, Nov. 2023. Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.wiesp-1.8
5. U. Fischer. On the road to Tagged PDF: About StructElem, marked content, PDF/A and squeezed Bärs. TUGboat 42(2):170–173, 2021. https://doi.org/10.47397/tb/42-2/tb131fischer-tagpdf