1. The Medical Article Records Groundtruth dataset.
http://marg.nlm.nih.gov/roverintro.asp
2. Beusekom, J.V.: Diploma thesis: Document layout analysis. Image Understanding and Pattern Recognition Group, Department of Computer Science, Month Unknown, pp. 1–67 (2006)
3. Cesarini, F., Lastri, M., Marinai, S., Soda, G.: Encoding of modified XY trees for document classification. In: Proceedings of the Sixth International Conference on Document Analysis and Recognition, pp. 1131–1136. IEEE (2001)
4. Collins-Thompson, K., Nickolov, R.: A clustering-based algorithm for automatic document separation. In: SIGIR 2002 Workshop on Information Retrieval and OCR: From Converting Content to Grasping, Meaning, Tampere, Finland (2002)
5. Gao, H., Rusinol, M., Karatzas, D., Lladós, J.: Fast structural matching for document image retrieval through spatial databases. In: DRR, pp. 90,210N–90,210N (2014)