Affiliation:
1. College of Mathematics and Computer Science, Northwest Minzu University, Lanzhou City, P. R. China
Abstract
In this paper, we proposed a novel method for text line segmentation of Tibetan historical document image with uchen script based on contour tracking. Our method is mainly to segment the text lines from the image documents using the contour curve of the text lines, which consists of three parts: First, we calculate the barycentre coordinates of the connected components for the text regions, and then the barycentre of each text line is connected in order, so that the main part of each text line is connected and a new connected component is formed; then the contour curve of the connected component is obtained using the contour tracing algorithm; Second, the contour curve and the barycentre gravity are used to assign key elements (such as the syllable point, the upper vowel, the lower vowel, and the broken strokes and so on) of the text lines, and next the candidate text lines are obtained based on these connected components; Finally, the contour tracking algorithm is used to calculate the contour curve of the candidate text lines and segment the text lines. We evaluated our text line segmentation method on the 200 document image data sets. Experimental results show that the proposed method based on contour curve tracing can accurately segment the text lines of image documents and achieve the encouraging results.
Publisher
World Scientific Pub Co Pte Lt
Subject
Artificial Intelligence,Computer Vision and Pattern Recognition,Software
Cited by
13 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献