A Review of Document Binarization: Main Techniques, New Challenges, and Trends-Reference-Cited by-同舟云学术

A Review of Document Binarization: Main Techniques, New Challenges, and Trends

Published:2024-04-07 Issue:7 Volume:13 Page:1394
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Yang Zhengxian¹,Zuo Shikai¹,Zhou Yanxi¹,He Jinlong¹,Shi Jianwen¹

Affiliation:

1. School of Opto-Electronic and Communication Engineering, Department of Microelectronics, Xiamen University of Technology, Xiamen 361024, China

Abstract

Document image binarization is a challenging task, especially when it comes to text segmentation in degraded document images. The binarization, as a pre-processing step of Optical Character Recognition (OCR), is one of the most fundamental and commonly used segmentation methods. It separates the foreground text from the background of the document image to facilitate subsequent image processing. In view of the different degradation degrees of document images, researchers have proposed a variety of solutions. In this paper, we have summarized some challenges and difficulties in the field of document image binarization. Approximately 60 methods documenting image binarization techniques are mentioned, including traditional algorithms and deep learning-based algorithms. Here, we evaluated the performance of 25 image binarization techniques on the H-DIBCO2016 dataset to provide some help for future research.

Funder

Natural Science Foundation of Fujian Province of China

Educational Teaching Reform Research Project of Xiamen University of Technology in 2022

Publisher

MDPI AG

Link

https://www.mdpi.com/2079-9292/13/7/1394/pdf

Reference124 articles.

1. Gatos, B., Pratikakis, I., Kepene, K., and Perantonis, S.J. (2024, March 26). Text Detection in Indoor/Outdoor Scene Images. Available online: https://www.researchgate.net/publication/253135219_Text_Detection_in_IndoorOutdoor_Scene_Images.

2. Pan, Y.F., Hou, X., and Liu, C.L. (2009, January 26–29). Text Localization in Natural Scene Images Based on Conditional Random Field. Proceedings of the 2009 10th International Conference on Document Analysis and Recognition, Barcelona, Spain.

3. Liao, M., Wan, Z., Yao, C., Chen, K., and Bai, X. (2019). Real-time Scene Text Detection with Differentiable Binarization. arXiv.

4. Efficient multi-scale 3D CNN with fully connected CRF for accurate brain lesion segmentation;Kamnitsas;Med. Image Anal.,2016

5. Atia, N., Benzaoui, A., Jacques, S., Hamiane, M., Kourd, K.E., Bouakaz, A., and Ouahabi, A. (2022). Particle Swarm Optimization and Two-Way Fixed-Effects Analysis of Variance for Efficient Brain Tumor Segmentation. Cancers, 14.