A Multi-Stage Method for Logo Detection in Scanned Official Documents Based on Image Processing

Author:

Guijarro María1ORCID,Bayon Juan1,Martín-Carabias Daniel2,Recas Joaquín1ORCID

Affiliation:

1. Department of Computer Architecture and Automation, Complutense University of Madrid, 28040 Madrid, Spain

2. PSPDFKit GmbH, 28040 Madrid, Spain

Abstract

A logotype is a rectangular region defined by a set of characteristics, which come from the pixel information and region shape, that differ from those of the text. In this paper, a new method for automatic logo detection is proposed and tested using the public Tobacco800 database. Our method outputs a set of regions from an official document with a high probability to contain a logo using a new approach based on the variation of the feature rectangles method available in the literature. Candidate regions were computed using the longest increasing run algorithm over the document blank lines’ indices. Those regions were further refined by using a feature-rectangle-expansion method with forward checking, where the rectangle expansion can occur in parallel in each region. Finally, a C4.5 decision tree was trained and tested against a set of 1291 official documents to evaluate its performance. The strategic combination of the three previous steps offers a precision and recall for logo detention of 98.9% and 89.9%, respectively, being also resistant to noise and low-quality documents. The method is also able to reduce the processing area of the document while maintaining a low percentage of false negatives.

Funder

Spanish Ministry of Science and Innovation

Publisher

MDPI AG

Reference43 articles.

1. Hoi, S.C., Wu, X., Liu, H., Wu, Y., Wang, H., Xue, H., and Wu, Q. (2015). Logo-net: Large-scale deep logo detection and brand recognition with deep region-based convolutional networks. arXiv.

2. Constantinopoulos, C., Meinhardt-Llopis, E., Liu, Y., and Caselles, V. (2011, January 11–15). A robust pipeline for logo detection. Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, Barcelona, Spain.

3. Elliptical asift agglomeration in class prototype for logo detection;Boia;BMVC,2015

4. Revaud, J., Douze, M., and Schmid, C. (2012, January 29). Correlation-based burstiness for logo retrieval. Proceedings of the 20th ACM International Conference on Multimedia, Nara, Japan.

5. Logo Detection With No Priors;Velazquez;IEEE Access,2021

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3