A Multi-Stage Method for Logo Detection in Scanned Official Documents Based on Image Processing
-
Published:2024-04-22
Issue:4
Volume:17
Page:170
-
ISSN:1999-4893
-
Container-title:Algorithms
-
language:en
-
Short-container-title:Algorithms
Author:
Guijarro María1ORCID, Bayon Juan1, Martín-Carabias Daniel2, Recas Joaquín1ORCID
Affiliation:
1. Department of Computer Architecture and Automation, Complutense University of Madrid, 28040 Madrid, Spain 2. PSPDFKit GmbH, 28040 Madrid, Spain
Abstract
A logotype is a rectangular region defined by a set of characteristics, which come from the pixel information and region shape, that differ from those of the text. In this paper, a new method for automatic logo detection is proposed and tested using the public Tobacco800 database. Our method outputs a set of regions from an official document with a high probability to contain a logo using a new approach based on the variation of the feature rectangles method available in the literature. Candidate regions were computed using the longest increasing run algorithm over the document blank lines’ indices. Those regions were further refined by using a feature-rectangle-expansion method with forward checking, where the rectangle expansion can occur in parallel in each region. Finally, a C4.5 decision tree was trained and tested against a set of 1291 official documents to evaluate its performance. The strategic combination of the three previous steps offers a precision and recall for logo detention of 98.9% and 89.9%, respectively, being also resistant to noise and low-quality documents. The method is also able to reduce the processing area of the document while maintaining a low percentage of false negatives.
Funder
Spanish Ministry of Science and Innovation
Reference43 articles.
1. Hoi, S.C., Wu, X., Liu, H., Wu, Y., Wang, H., Xue, H., and Wu, Q. (2015). Logo-net: Large-scale deep logo detection and brand recognition with deep region-based convolutional networks. arXiv. 2. Constantinopoulos, C., Meinhardt-Llopis, E., Liu, Y., and Caselles, V. (2011, January 11–15). A robust pipeline for logo detection. Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, Barcelona, Spain. 3. Elliptical asift agglomeration in class prototype for logo detection;Boia;BMVC,2015 4. Revaud, J., Douze, M., and Schmid, C. (2012, January 29). Correlation-based burstiness for logo retrieval. Proceedings of the 20th ACM International Conference on Multimedia, Nara, Japan. 5. Logo Detection With No Priors;Velazquez;IEEE Access,2021
|
|