Abstract
The paper presents the modified method of the text lines separation in the handwritten manuscripts. Such an approach is required for the medieval text analysis, where multiple text lines overlap and are written at different angles. The proposed approach consists in dividing the bounding boxes into smaller components based on the points of the character curves intersection. The method considers the askew text lines, producing non-rectangular zones between the neighboring lines.
Publisher
Warsaw University of Life Sciences - SGGW Press
Subject
Computer Graphics and Computer-Aided Design,Computer Vision and Pattern Recognition,Software
Reference15 articles.
1. A robust algorithm for text string separation from mixed text/graphics images
2. L. Likforman-Sulem and A. Hanimyan and C. Faure: A Hough based algorithm for extracting text lines in handwritten documents. In Proceedings of the Third International Conference on Document Analysis and Recognition, vol. 2, pp. 774-771, 1995.
3. I. S. I. Abuhaiba, S. Datta, and M. J. J. Holt: Line extraction and stroke ordering of text pages. In the Third International Conference on Document Analysis and Recognition, vol. 1, pp. 390-393, 1995.
4. An algorithm for extracting cursive text lines
5. Segmentation of single- or multiple-touching handwritten numeral string using background and foreground analysis