Affiliation:
1. Artificial Intelligence and Applied Computer Science, University of Würzburg, 97074 Würzburg, Germany
Abstract
Digitization and transcription of historic documents offer new research opportunities for humanists and are the topics of many edition projects. However, manual work is still required for the main phases of layout recognition and the subsequent optical character recognition (OCR) of early printed documents. This paper describes and evaluates how deep learning approaches recognize text lines and can be extended to layout recognition using background knowledge. The evaluation was performed on five corpora of early prints from the 15th and 16th Centuries, representing a variety of layout features. While the main text with standard layouts could be recognized in the correct reading order with a precision and recall of up to 99.9%, also complex layouts were recognized at a rate as high as 90% by using background knowledge, the full potential of which was revealed if many pages of the same source were transcribed.
Funder
German Research Foundation
Subject
Computational Mathematics,Computational Theory and Mathematics,Numerical Analysis,Theoretical Computer Science
Reference34 articles.
1. Antonacopoulos, A., Clausner, C., Papadopoulos, C., and Pletschacher, S. (2013, January 25–28). ICDAR 2013 Competition on Historical Book Recognition (HBR 2013). Proceedings of the 2013 12th International Conference on Document Analysis and Recognition, Washington, DC, USA.
2. Zhong, X., Tang, J., and Jimeno Yepes, A. (2019, January 20–25). PubLayNet: Largest Dataset Ever for Document Layout Analysis. Proceedings of the 2019 International Conference on Document Analysis and Recognition (ICDAR), Sydney, Australia.
3. Najem-Meyer, S., and Romanello, M. (2022). Page Layout Analysis of Text-heavy Historical Documents: A Comparison of Textual and Visual Approaches. arXiv.
4. Jocher, G. (2022, November 10). YOLOv5 by Ultralytics. Available online: https://github.com/ultralytics/yolov5.
5. Beyond Document Object Detection: Instance-Level Segmentation of Complex Layouts;Biswas;Int. J. Doc. Anal. Recognit. (IJDAR),2021
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献