Effective Document Image Rectification via a Deep Learning Framework-Reference-Cited by-同舟云学术

Effective Document Image Rectification via a Deep Learning Framework

Published:2024-02 Issue:02 Volume:38 Page:
ISSN:0218-0014
Container-title:International Journal of Pattern Recognition and Artificial Intelligence
language:en
Short-container-title:Int. J. Patt. Recogn. Artif. Intell.

Author:

Lin Hsiau-Wen¹^ORCID,Lin Hwei Jen²^ORCID,Tsai Yihjia²^ORCID,Tokuyama Yoshimasa³^ORCID,Kong Chou-Wei²^ORCID

Affiliation:

1. Department of Information Management, Chihlee University of Technology, Taipei, Taiwan

2. Department of Computer Science and Information Engineering, Tamkang University, Taipei, Taiwan

3. Department of Media and Image Technology, Faculty of Engineering, Tokyo Polytechnic University, Japan

Abstract

This paper proposes an efficient method for rectifying distorted document images via deep learning, ultimately improving the legibility of graphics and text in documents. The framework comprises two interconnected UNets, working in tandem to predict a 3D coordinate map and a forward map for the input distorted document image, respectively. At the beginning of the process, a page mask is predicted and used as input to both U-Nets to help improve the performance of their tasks. In the last step, the predicted forward map is transformed into a corresponding backward map, which is utilized to rectify the distorted image. The experimental results not only reveal that the predicted page masks and 3D coordinate maps significantly enhance the accuracy of predicting forward maps for subsequent rectification but also demonstrate satisfactory results both globally and locally.

Funder

National Science and Technology Council, Taiwan

Publisher

World Scientific Pub Co Pte Ltd

Link

https://www.worldscientific.com/doi/pdf/10.1142/S0218001423510230

Reference23 articles.

1. Geometric and shading correction for images of printed materials using boundary

2. DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks

3. The Common Fold

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Constructing an innovative system of management and education in colleges and universities based on artificial intelligence technology;Applied Mathematics and Nonlinear Sciences;2024-01-01