A Survey on Document Image Processing Methods Useful for Assistive Technology for the Blind-Reference-Cited by-同舟云学术

A Survey on Document Image Processing Methods Useful for Assistive Technology for the Blind

Published:2015-01 Issue:01 Volume:15 Page:1550005
ISSN:0219-4678
Container-title:International Journal of Image and Graphics
language:en
Short-container-title:Int. J. Image Grap.

Author:

Keefer Robert¹,Bourbakis Nikolaos²

Affiliation:

1. Pomiet, LLC, Dayton, OH 45449, USA

2. Wright State University, Dayton, OH 45449, USA

Abstract

This paper offers a review of the state-of-the-art document image processing methods and their classification by identifying new trends for automatic document processing and understanding. Document image processing (DIP) is an important problem related with most of the challenges coming from the image processing field and with applications to digital document summarization, readers for the visually impaired etc. Difficulties in the processing of documents can arise from lighting conditions, page curl, page rotation in 3D, and page layout segmentation. Document image processing is usually performed in the context of higher-level applications that require an undistorted document image such as optical character recognition and document restoration/preservation. Typically, assumptions are made to constrain the processing problem in the context of a particular application. In this survey, we categorize document image processing methods on the basis of the technique, provide detailed descriptions of representative methods in each category, and examine their pros and cons. It important to notice here that the DIP field is broad, thus we try to provide a top–down/horizontal survey rather a bottom up. At the same time, we target the area of document readers for the blind, and use this application to guide us in a top–down survey of DIP. Moreover, we present a comparative survey based on important aspects of a marketable system that is dependent on document image processing techniques.

Publisher

World Scientific Pub Co Pte Lt

Subject

Computer Graphics and Computer-Aided Design,Computer Science Applications,Computer Vision and Pattern Recognition

Link

https://www.worldscientific.com/doi/pdf/10.1142/S0219467815500059

Reference34 articles.

1. Camera-based analysis of text and documents: a survey

2. Building cameras for capturing documents

3. Example-based single document image super-resolution: a global MAP approach with outlier rejection

4. A Threshold Selection Method from Gray-Level Histograms

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. DNN-HHOA: Deep Neural Network Optimization-Based Tabular Data Extraction from Compound Document Images;International Journal of Image and Graphics;2024-01-23

2. Behavioral analysis of bar charts in documents via stochastic petri-net modeling;Pattern Recognition Letters;2023-12

3. An innovative document image binarization approach driven by the non-local p-Laplacian;EURASIP Journal on Advances in Signal Processing;2022-06-18

4. End to End Invoice Processing Application Based on Key Fields Extraction;IEEE Access;2022

5. Detection and Correction of Multi-Warping Document Image;International Journal of Image and Graphics;2021-07-30