A Combined Approach for the Binarization of Historical Tibetan Document Images

Author:

Han Yuehui12ORCID,Wang Weilan1,Liu Huaming3,Wang Yiqun1

Affiliation:

1. Key Laboratory of China’s Ethnic Languages and Information, Technology of Ministry of Education, Northwest Minzu University, Lanzhou, P. R. China

2. School of Mathematics and Computer Science, Northwest Minzu University, Lanzhou, P. R. China

3. Computer and Information School, Fuyang Normal College, Fuyang, P. R. China

Abstract

It is common that historical Tibetan documents belonging to historical collections are poorly preserved and are prone to degradation processes. This causes many challenges that can be addressed by image binarization, the most common of which is stains. A lack of uniform standard datasets makes it difficult to evaluate binarization effects. Motivated by the poor effects and difficulty of evaluating the binarization of historical Tibetan document images, a combined approach is proposed that aims to improve overall performance. The method includes the following parts: first, image generation through standard binarization and the background extraction of color images, which are both used for image synthesis in preparing for an evaluation. Then, preliminary binarization processing is implemented through channel combination in the lab color space and through local binarization. The synthetic images are used to select the coefficient when the channels are combined. Furthermore, Local Binary Pattern (hereafter LBP) and image smoothing is carried out after the combination of the channels to obtain the outline of the text area. Finally, the final binarization image is obtained by combining the preliminary binarization image and the text area contour image. Our method achieved top performance compared to other methods after a large number of synthetic image tests with a variety of background types.

Publisher

World Scientific Pub Co Pte Lt

Subject

Artificial Intelligence,Computer Vision and Pattern Recognition,Software

Cited by 10 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3