1. W. Duivesteijn, S. Hess, X. Du. How to cheat the page limit. WIREs Data Mining and Knowledge Discovery 10, Feb. 2020. https://doi.org/10.1002/widm.1361
2. D. Ginev. CorTeX: A general purpose processing framework for corpora of scientific documents. https://github.com/dginev/CorTeX
3. B. Miller, D. Ginev. LaTeXML: A LaTeX to XML converter. https://dlmf.nist.gov/LaTeXML/
4. H. Stamerjohanns. texmlbus: A build system to convert documents to XML and other formats. https://github.com/stamer/texmlbus
5. H. Stamerjohanns, M. Kohlhase. Transforming the arχiv to XML. In 9th International Conference, AISC 2008 15th Symposium, Calculemus 2008 7th International Conference, MKM 2008 Birmingham, UK, July 28–August 1, 2008, S. Autexier, J. Campbell, et al., eds., Intelligent Computer Mathematics, pp. 574–582. Springer Verlag, 2008.