Advanced Hough-based method for on-device document localization-Reference-Cited by-同舟云学术

Advanced Hough-based method for on-device document localization

Published:2021-09 Issue:45 Volume:5 Page:702-712
ISSN:2412-6179
Container-title:Computer Optics
language:ru
Short-container-title:

Author:

Tropin D.V.¹,Ershov A.M.²,Nikolaev D.P.³,Arlazarov V.V.⁴

Affiliation:

1. Moscow Institute of Physics and Technology (National Research University), Dolgoprudny, Russia; FRC CSC RAS, Moscow, Russia; LLC "Smart Engines Service", Moscow, Russia

2. Moscow State University, Moscow, Russia; LLC "Smart Engines Service", Moscow, Russia

3. Institute for Information Transmission Problems of the RAS (Kharkevich Institute), Moscow, Russia; LLC "Smart Engines Service", Moscow, Russia

4. FRC CSC RAS, Moscow, Russia; LLC "Smart Engines Service", Moscow, Russia

Abstract

The demand for on-device document recognition systems increases in conjunction with the emergence of more strict privacy and security requirements. In such systems, there is no data transfer from the end device to a third-party information processing servers. The response time is vital to the user experience of on-device document recognition. Combined with the unavailability of discrete GPUs, powerful CPUs, or a large RAM capacity on consumer-grade end devices such as smartphones, the time limitations put significant constraints on the computational complexity of the applied algorithms for on-device execution. In this work, we consider document location in an image without prior knowledge of the docu-ment content or its internal structure. In accordance with the published works, at least 5 systems offer solutions for on-device document location. All these systems use a location method which can be considered Hough-based. The precision of such systems seems to be lower than that of the state-of-the-art solutions which were not designed to account for the limited computational resources. We propose an advanced Hough-based method. In contrast with other approaches, it accounts for the geometric invariants of the central projection model and combines both edge and color features for document boundary detection. The proposed method allowed for the second best result for SmartDoc dataset in terms of precision, surpassed by U-net like neural network. When evaluated on a more challenging MIDV-500 dataset, the proposed algorithm guaranteed the best precision compared to published methods. Our method retained the applicability to on-device computations.

Funder

Russian Foundation for Basic Research

Publisher

Samara State National Research University

Subject

Electrical and Electronic Engineering,Computer Science Applications,Atomic and Molecular Physics, and Optics

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Unfolder: fast localization and image rectification of a document with a crease from folding in half;COMPUT OPT;2024

2. Line segment detection algorithm in image extraction improvement study;Journal of Measurements in Engineering;2024-02-29

3. High-Performance Digital Image Processing;Pattern Recognition and Image Analysis;2023-12

4. Document Localization and Classification As Stages of a Document Recognition System;Pattern Recognition and Image Analysis;2023-12

5. A Robust Approach to Detect Occlusions During Camera-Based Document Scanning;2023 IEEE Latin American Conference on Computational Intelligence (LA-CCI);2023-10-29