An Encoder–Decoder Architecture within a Classical Signal-Processing Framework for Real-Time Barcode Segmentation

Author:

Gómez-Cárdenes Óscar1ORCID,Marichal-Hernández José Gil1ORCID,Son Jung-Young2ORCID,Pérez Jiménez Rafael3ORCID,Rodríguez-Ramos José Manuel14

Affiliation:

1. Department of Industrial Engineering, Universidad de La Laguna, 38200 La Laguna, Spain

2. Biomedical Engineering Department, Konyang University, Nonsan-si 320-711, Republic of Korea

3. Institute for Technological Development and Innovation in Communications, Universidad de Las Palmas de Gran Canaria, 35017 Las Palmas, Spain

4. Research & Development Department, Wooptix S.L., 38204 La Laguna, Spain

Abstract

In this work, two methods are proposed for solving the problem of one-dimensional barcode segmentation in images, with an emphasis on augmented reality (AR) applications. These methods take the partial discrete Radon transform as a building block. The first proposed method uses overlapping tiles for obtaining good angle precision while maintaining good spatial precision. The second one uses an encoder–decoder structure inspired by state-of-the-art convolutional neural networks for segmentation while maintaining a classical processing framework, thus not requiring training. It is shown that the second method’s processing time is lower than the video acquisition time with a 1024 × 1024 input on a CPU, which had not been previously achieved. The accuracy it obtained on datasets widely used by the scientific community was almost on par with that obtained using the most-recent state-of-the-art methods using deep learning. Beyond the challenges of those datasets, the method proposed is particularly well suited to image sequences taken with short exposure and exhibiting motion blur and lens blur, which are expected in a real-world AR scenario. Two implementations of the proposed methods are made available to the scientific community: one for easy prototyping and one optimised for parallel implementation, which can be run on desktop and mobile phone CPUs.

Funder

Regional Ministry of Economy, Knowledge, and Employment

European Social Fund

Government of the Canary Islands

European Regional Development Fund

Research agreement on consumer electronics Wooptix-ULL, 2023

Korean government

Ministry of Education

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Reference56 articles.

1. (2022). GS1 General Specifications (Standard No. GS 22423-2023). Available online: https://www.gs1.org/docs/barcodes/GS1_General_Specifications.pdf.

2. Recent advances in augmented reality;Azuma;IEEE Comput. Graph. Appl.,2001

3. Muniz, R., Junco, L., and Otero, A. (November, January 31). A robust software barcode reader using the Hough transform. Proceedings of the 1999 International Conference on Information Intelligence and Systems, Bethesda, MD, USA.

4. Wachenfeld, S., Terlunen, S., and Jiang, X. (2010). Mobile Multimedia Processing: Fundamentals, Methods, and Applications, Springer.

5. Reading 1D Barcodes with Mobile Phones Using Deformable Templates;Gallo;IEEE Trans. Pattern Anal. Mach. Intell.,2011

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3