Enhancing Document Image Retrieval in Education: Leveraging Ensemble-Based Document Image Retrieval Systems for Improved Precision

Author:

Alzoubi Yehia Ibrahim1ORCID,Topcu Ahmet Ercan2,Ozdemir Erdem3

Affiliation:

1. College of Business Administration, American University of the Middle East, Egaila 54200, Kuwait

2. College of Engineering and Technology, American University of the Middle East, Egaila 54200, Kuwait

3. Booking Holdings Inc., 1000 BP Amsterdam, The Netherlands

Abstract

Document image retrieval (DIR) systems simplify access to digital data within printed documents by capturing images. These systems act as bridges between print and digital realms, with demand in organizations handling both formats. In education, students use DIR to access online materials, clarify topics, and find solutions in printed textbooks by photographing content with their phones. DIR excels in handling complex figures and formulas. We propose using ensembles of DIR systems instead of single-feature models to enhance DIR’s efficacy. We introduce “Vote-Based DIR” and “The Strong Decision-Based DIR”. These ensembles combine various techniques, like optical code reading, spatial analysis, and image features, improving document retrieval. Our study, using a dataset of university exam preparation materials, shows that ensemble DIR systems outperform individual ones, promising better accuracy and efficiency in digitizing printed content, which is especially beneficial in education.

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Reference38 articles.

1. Deep learned vectors’ formation using auto-correlation, scaling, and derivations with CNN for complex and huge image retrieval;Naeem;Complex Intell. Syst.,2023

2. Content-based medical image retrieval by spatial matching of visual words;Shamna;J. King Saud Univ.-Comput. Inf. Sci.,2022

3. Efficient content-based image retrieval using deep search and rescue algorithm;Keisham;Soft Comput.,2022

4. Concentrated hashing with neighborhood embedding for image retrieval and classification;Mo;Int. J. Mach. Learn. Cybern.,2022

5. Iterative quantization: A procrustean approach to learning binary codes for large-scale image retrieval;Gong;IEEE Trans. Pattern Anal. Mach. Intell.,2012

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3